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SOME REMARKS ON SET THEORY 
P. ERDOS 


The present note contains a few disconnected remarks on the 
theory of sets. 

It is well known that the addition of ordinal numbers does not 
satisfy the the law of commutativity, for example, 1+ow+1. 
Let now be a finite number. Denote by f() the maximum number 
of different ordinals we can obtain by adding in all possible ways n 
ordinals. We prove the following theorem. 


THEOREM I. We have 


(1) f(m) = max (k2*-! + 1)f(n — 

rSn-1 
In fact, f(2)=2, f(3)=5, f(4) =13, f(S) =33, f(6) =81, f(7) =193, 
f(8) =449, f(9) =1089, f(10) = 2673, f(11) =6561, f(12) = 15633, f(13) 
=37249, f(14) =88209, f(15) =216153, - - - , and for x23, f(5x+1) 
= 817, f(Sx+2) =193-812-', f(Sx+3) =1937817-*, f(Sx+4) =193*817-5, 
f(S5x+5) =33-81*. Thus for n2=21 


(2) f(n) = 81f(n — 5). 


Let there be given m ordinals a4, a2, - - - , @a. It is well known that 
every ordinal can be written uniquely as the sum of indecomposable 
ordinals. (An ordinal is said to be indecomposable if it is not the sum of 
two smaller ordinals.) Denote by ¢(a) the largest of these indecom- 
posable ordinals belonging to a. (6(a@) may have a coefficient c in the 
decomposition of a.) Put y=minis, ¢(a;), and assume that there are 
ka’s with ¢(a;) =y. Denote these a’s by a1, a2, - - - , ax. If in the sum 
+ai,, i, 2, t, a permutation of 1, 2,---,m, 
none of the a;, i appear at the end, they get absorbed in the fol- 
lowing summands, and we get exactly f(m—k) different sums. Assume 
next that exactly r of the a,’s, rSk, appear at the end of a:,+ai, 


+----+a;,. Put 
(3) 2*y bi, 6; = for t,j k. 
We then have 


(4) ai taint: tay, = B+ (2% + + + 


where B2wy. (All the 4’s except 6;, get absorbed.) Now 2%+2‘- 
+ ---+-+2- can be chosen in C;,- ways and 6;, in r ways. Thus 
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the number of sums is rC;,,f(n—k). (The k—r a’s with (a) =y not 
appearing at the end get absorbed.) Summing for r, we obtain 


(S) +> | — k) = (k2*1 + 1)f(n — 


r=1 
Hence clearly 


(6) f(mn) = max (k2*-! + 1)f(n — k) 

(since it clearly follows from our proof that our choice of the a’s 
gives the maximum number of different summands). This proves (1). 

We obtain from (1) by a simple computation that for »<20 the 
value of f(”) is given by Theorem I. The rest of Theorem I is easily 
proved by induction, we have to use that (k2*-'+1)"* (& integer) 
increases for k=5 and decreases for k=5. We suppress the details 
since they can easily be given and depend only on numerical esti- 
mates. Thus the proof of Theorem I is complete. 

Mr. Spanier remarked that the number of different products one 
can obtain from ordinals is n!. It suffices to choose a;=w+1, 


Q@2=w+2,--+,a,=w+n. A simple computation shows that 
where 4, i2, - is any permutation of 1, 2,---, m. 
Let X be a set of power m. Letters a, b, - - - denote subsets of 
X,A,B, +--+, sets of subsets of X. A and B are defined (by Lusin) to 
be orthogonal if for any a€ A, DCB, af\b has power less than m. The 4 


orthogonal sets are said to be separable if there exist ¢ and d with 
c(\d empty and such that for every a€A, DEB, aCcVa’, bCdUD’, 
where the power of a’ and 0’ is less than m. 

Lusin! proves with the aid of the axiom of choice that if m=No 
there exists two orthogonal sets which are not separable. We shall 
give a very simple proof of this result for all m, which for m=No y 
will be independent of the axiom of choice. 

Let A consist of m disjoint sets of power m and B consist of all the 
sets which intersect the sets of A in not more than one point. Clearly 
A and B are orthogonal, but they clearly are not separable, for every 
set c which intersects all the sets of A contains a set of B. For m=No 
this proof is independent of the axiom of choice, but in the general 
case the equality m*=™m is used and this is equivalent to the axiom 
of choice.? 


1C. R. (Doklady) Acad. Sci. URSS. vol. 40 (1943) pp. 175-178. 
* Tarski, Fund. Math. vol. 7, pp. 147-154. 
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Lusin'! also proves that if both A and B contain only countably 
many sets, then they are separable. His proof generalizes to the case 
when ™m is a regular number and both A and B contain only m sets. 
We give the proof only for the sake of completeness. 

Suppose A consists of the sets {a.} and B of the sets {b.}. Put 


c= Db), d= Da). 
a a 
Clearly c and d separate A and B. 
The proof breaks down if m is singular. We shall give an example 
which shows that the theorem is not always true for singular m. Put 


m= + 24 24... 


Let A consist of countably many disjoint sets a; of power m. Write 
--- where card (n?)=Npo, card - - - (card 
a denotes the cardinal number of a) and nf’On{°=0. B con- 
sists of all the sets b of the form Un", where f(i) is any function of i 
(clearly card B=28e<m). A and B are clearly orthogonal, but a 
simple argument shows that they are not separable. 

If 2 is the smallest cardinal number cofinal to m and A and B con- 
tain not more than sets, then it is easy to see that if A and B are 
orthogonal, they are also separable. 

The orthogonal sets A and B are said to be complete (Lusin) if we 
can add no set either to A or to B without destroying orthogonality, 
that is, if X is the set of integers and A consists of all the subsets con- 
taining only a finite number of even numbers and B of all sets con- 
taining only finitely many odd numbers. 

A and B are called k-orthogonal if for any a@€A and JCB, 
card (a/\b) <S,. k-completeness can be defined in the obvious way. 
We shall prove the following theorem. 


THEOREM II. The cardinal number N of the k-complete orthogonal 
pairs equals 


N = 


We shall assume the generalized continuum hypothesis 2% =Ni4:. 
Tarski® proved (by using the generalized hypothesis of the con- 
tinuum) that there exist m** subsets of X(card X =m) such that the 
intersection of any two has power less than &;. Denote such a set of 
subsets by C. Split C in an arbitrary way into the union of two sets 
of subsets A’ and B’. This can clearly be done in 2™ ways. Clearly 


3 Ibid. vol. 12, pp. 186-206 and vol. 14, pp. 205-216. 
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A’ and B’ are k-orthogonal. It immediately follows from the axiom 
of choice that they can be extended to the k-complete orthogonal pair 
A and B, and a simple argument shows that to different A’ and B’ 
correspond different A and B. This shows that 


Let now A’ be any set of subsets of X. There clearly exists a 
maximal set of subsets of X, B say, which is k-orthogonal to A’. We 
shall prove that there exists a subset A’’ of A’ of power Sm*®* which 
determines the same set B. Consider all the subsets of power N; of 
the sets aA’. The power of these sets is clearly not greater than 
m*®, To each of these sets select an arbitrary a2€ A’ which contains 
it, and let the sets of A’’ be all these a,. Clearly if B is k-orthogonal to 
A”’ then it is also k-orthogonal to A’ (and vice versa), which com- 
pletes the proof of our statement. But then a simple argument shows 
that the number of complete k-orthogonal pairs is not greater than 


(2™) mite 2 me 


which completes the proof of Theorem II. In the second part of our 
proof we clearly did not use the continuum hypothesis. 

It follows from Theorem II that the number of complete orthogonal 
pairs is 2". The number of complete orthogonal and separable pairs 
is clearly only 2”, which again shows that there are orthogonal pairs 
which are not separable. 

Without using the continuum hypothesis it seems to be very hard 
to prove that the number of complete orthogonal pairs is greater 
than 2™. But if m=, this is quite easy, since it is well known that 
there exist 2®° sets of integers such that the intersection of any two 
is finite. It suffices to let a. consist of the integers 2"+ [n*] where a 
is an arbitrary positive real number. 


THEOREM III. Let S be any infinite subset of k-dimensional Euclidean 
space. Card S=m. Then there exists a subset S, of S, card S;=m, such 
that all the distances between any two points of S, are different. 


REMARK. We are not going to assume the continuum hypothesis. 
In fact the proof will be complicated only because we cannot exclude 
the possibility that m is singular. 

We use induction with respect to the dimension k. We slightly 
strengthen the statement of our theorem. In fact we prove: Let the 
set S (card S=m) be situated on m (n<m) k-dimensional hyperplanes 
or hyperspheres. Then there exists a subset S; of S, card S;=m, such 
that all the distances between any two points of S; are different. 
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First we prove this for k=1. Put S=L+C, where L is the subset of 
S situated on lines and C the subset situated on circles. Assume first 
card L=m. Denote by M, the set of lines containing L. By assump- 
tion card M,psn<m. Let L, be a subset of L, such that any line 
bisecting the distance between any two points of J; is not in M,, and 
I, is maximal with respect to this property (it clearly follows from 
the axiom of choice that such an J, exists, possibly it is empty). We 
prove card L,=m. For if not assume that card L;,=r<m, then by 
definition of Z;, to every point x of L—J, there exists a point y of 
ZI, such that the perpendicular bisector of x and y is a line of Mz. 
But a point y of LZ, and a line / of Mz, uniquely determines a point x 
of L—L, such that the perpendicular bisector of the segment [x, y] is /. 
But since card L; <<m andcardM, <™m this would imply that =m, 
and hence card L<m, which is not true. Thus we prove that card Z; 
=m. Let now L, be a maximal subset of LZ; with the property that 
all distances between points of ZL, are different. Let card L.=t. If 
t<m, then all the points of Z,—Z, must lie on ¢ circles or lines. But 
this is impossible since the lines cannot coincide with any of the lines 
of M_z, therefore any of these lines can intersect any line of M, inat 
most one point. Therefore each of these lines can contain at most 
card My, points of L,—JZ:, and each of the circles can contain also 


“at most card My, points of Ii—Z,. But this would mean that 


card (L,—L:) St-card Myi<m which is not the case. Thus t=m, 
which completes the case card L=m. Assume next card C=m. De- 
note by M, the set of circles containing C, put card Mc=r<m, 
denote further by O the set of centers of the circles of Mc,, and con- 
sider the set C—O. Denote by C; a maximal subset of C—O with the 
property that all distances between points of C, are different. Then 
it is easy to see that card C;=m, which completes the proof in case 
k=1. 

Assume now that our theorem is true for k—1, and we shall prove 
it for k. Suppose then that S lies on n<m k-dimensional hyperplanes 
and hyperspheres. By the same argument as used in the previous 
pages we can find m (k —1)-dimensional hyperplanes and hyperspheres 
which contain m points of S, and by the induction hypothesis this 
completes the proof of Theorem III. In case m is a regular number 
it is easy to give a very much simpler proof. 

Assume that m is regular and that S is a set of power m in k-dimen- 
sional Euclidean space. Then we can select a subset S; of S of power 
m such that for r=1, 2, - - - , & the volume of any two r-dimensional 
nondegenerate simplices is different. The proof is similar to the case 
for r=1. If m is singular this result is false. Take m=N., and denote 
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by h, 2, - - - a countable set of parallel lines. Let /, contain S; points 
of S. Clearly any subset S, of S of power &; contains two nondegen- 
erate triangles of the same area. 

If the set S is in Hilbert space our theorems of the previous pages 
do not generalize. In fact Oxtoby and I constructed a set of power &i, 
such that the distance between any two points is rational. We do not 
give the construction here. At present we cannot decide whether 
there exists in Hilbert space such a set of power c. (Added in proof: 
Kakutani and I found a simple example of such a set.) 

One can ask the following question: Is it possible to split the n-di- 
mensional Euclidean space into countably many disjoint sets, such 
that in each set any two distances between any two points should be 
different. For =1 this is known,‘ and I do not know the answer for 
n>1. 

The problems we just considered have some interest also for 
finite sets. Let there be given m points (m finite) on a line. Then we 
can select c-n”* points among them, such that all the distances are 
different. It is probable that c-n'/* can be replaced by c-n/*. If the 
nm points are in k-dimensional space, we can select c-n'// points 
among them such that all the distances are different. I do not know 
the exact value of f(z). 

Dénes Kénig’s’ book on graphs contains the following theorem: 
Let G be a graph of order m (m is an infinite cardinal) and any two 
vertices are connected by less than m edges, and we assume ”<m. 
Then G is the product of linear factors. Let us first explain the ter- 
minology: The order of a point is the cardinal number of edges of its 
star (the star of a vertex is the collection of edges incident with it). 
A graph is said to be of order m, if every vertex of it has order m. 
Let Q, be the initial number belonging to the cardinal number m. G 
is the product of linear factors if we can make correspond to each edge 
of G an ordinal number 8 < Q;, such that the star of every vertex con- 
tains one and only one edge to which the ordinal 8, B=1, 2,---, 
corresponds. Kénig raises the question whether the condition “con- 
nected by less than m edges, and »<m” can be replaced by “connected 
by less than m edges.” The proof given in his book shows that the 
answer is affirmative in case m is a regular number. We shall show 
that the answer is negative for singular numbers. Let the vertices of 
G be the points {a;}, $=1,2,---,and {ba}, a<Q,. The vertices a; 
and a;, i<j, are connected by N; edges, and b, and a; are connected 
by &; edges. Clearly G is of order &. and any two vertices are con- 


‘ P. Erdés and S. Kakutani, Bull. Amer. Math. Soc. vol. 49 (1943) pp. 457-460. 
5 Theorie der endlichen und unendlichen Graphen, pp. 220-223. 
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nected by less than N. edges. Nevertheless G is not the product of 
linear factors, in fact G has no linear factor at all. (A linear factor is 
a subgraph of order 1 containing all the vertices of G.) Clearly any 
linear factor would have all the b’s as vertices, but then at least 
one a would have to have order Ni, which is impossible; this con- 
tradiction proves our assertion. It would be easy to construct an 
analogous counterexample for every other singular number. 
Now we prove the following theorem. 


THEOREM IV. Let G be a graph of order m, where every vertex is con- 


nected (by an edge) to at least m different vertices. Then G is the product 
of linear factors. 


REMARK. This theorem is clearly a generalization of the theorem 
given in Kénig’s book.’ My original proof was very complicated. 
Hajés found the following very much simpler proof: 

Let 2 be the initial number belonging to m, {aa}, a<Q%, be the 
vertices of G and {es}, B<Q, the edges of G. We construct the 
factors of G by transfinite induction. Let 0<y<Q, and suppose 
that for every 5<7y we have already found a linear factor. Then we 
construct the yth linear factor as follows: Let e, be the edge of 
smallest index which has not been used in any of the previous linear 
factors (in other words every ¢,, with p’<p, occurs in some previous 
linear factor). Let then e, be the first edge of our th linear factor. We 
construct the th factor by transfinite induction. Suppose we have 
already constructed a subgraph G,(a@) of order 1 of G containing e, 
and also all the vertices of index less than a of G. Clearly G,(a@) has 
less than m vertices. By our assumption a, is connected to at least m 
different vertices. Thus it is connected to m different vertices not in 
G,(a). In the factors of index less than y we clearly used less than m 
(in fact card yy) edges all emanating from a,. Thus there remains an 
unused edge of G which connects a, to a vertex not in G,(a), this edge 
will be in our yth factor. This construction gives our th factor for 
every y <Q; and clearly by construction every edge of G occurs in 
one of the factors once and only once, q.e.d. This proves Theorem 
IV. 

The following problem is due to Turan® (oral communication): 

Let card S=c, to every aC S there corresponds a finite subset f(a) 
of S. We assume that a is not contained in f(a). Two elements a and 6 


6 The problem was originally raised by Tur4n in connection with a problem on 
interpolation and solved by G. Griinwald in case card S= Xo. For the literature of 
the problem see, for example, P. Erdés, Some set-theoretical properties of graphs, 
Universidad Nacional de Tucuman Revista, 1942, also footnote 7. 
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are called independent if and b¢ f(a). A subset S’ of S is 
said to be independent if any two elements of it are independent 
or if S’(\f(S’) is empty. Turdn’s question was: does there always 
exist an infinite independent set? G. Griinwald showed that the answer 
is affirmative. Later Laz4ér showed that there exists an independent 
set of power c. Sierpinski and Ruzievicz raised the following general 
question: Let S be a set of power m, and let »<m, an arbitrary 
cardinal number. To every aC 5S there corresponds a subset f(a) of S 
satisfying a€ f(a) and f(a) <n. Does there always exist an independent 
set S’ with card S’=m? This has been proved if m is a regular number, 
or if m is the countable sum of smaller cardinals.? We shall prove, 
assuming the generalized continuum hypothesis, that the answer is 
always affirmative (Theorem V). 

If we replace the condition card (f(a))<n<m by card (f(a)) <m, 
then in general we do not even have two independent points. Let 
Q be the initial number belonging to m, and a; the set of ordinals 
less than Q;. Define f(a;) as the collection of all 8 <a;. Clearly no two 
elements are independent. 

First I give a very simple proof of Theorem V in case m is a regular 
number. This proof is due to D. Laz4r, and has been communicated 
to me orally. Assume that the theorem is false, that is, the power of 
every independent set is less than m. Let S, be a complete set of inde- 
pendent elements, that is, S,/\f(S:) is empty and if a@S,, then 
(@US))Mf(aU S;) is not empty. Consider S—S,—f(S;), and let S: be 
a maximal independent subset of it, and consider S—.S,—S.—f(S:) 
—f(S2). Continue this process for all ordinals a<Q;, where Q, is 
the initial number belonging to m=N,. Since m is a regular 
number, the power of U.(S.Uf(S.2)) is less than m. Thus the 
set S—U.(S.Uf(Sa)) is not empty. Let a be an element of this set. 
Because of the maximal property of the sets S., f(a) must intersect 
each of the S,’s (by our construction a¢f(S.), thus the last state- 
ment is a consequence of the fact that a and S, are not independent). 
But this would mean that card (f(a))2m, an evident contradiction. 
Clearly this proof also works if m is singular but is not the sum of n 
smaller cardinal numbers. 

Now we prove our main theorem. Let card S=m, m singular. Put 
S=U.Sa, a<Q,. Card (S.)=ma, ma>ms for a>B. Let be the 
initial number belonging to m and Q, the smallest ordinal number co- 
final to 2. We can clearly assume that each m, is regular (every 
singular number is the sum of fewer, smaller regular numbers). Also 


7 Sophie Piccard, Fund. Math. vol. 29, pp. 5-9. See also Comptes Rendus des 
Séances de la Société des Sciences et des Léttres de Varsovie vol. 30 (1937). 
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we can assume that m, > n* and m,>card (Q,). By the theorem proved 
in the previous pages, there exists for every S.aset S/ CS,, such that 
Sa\f(S2) is empty and card (SJ) =m,. Omit for 8>a all the ele- 
ments of f(S/) from Sj. Thus we get the sets SZ’ of power ma, such 
that 


(7) U s;') = 0 (0 stands for the empty set). 
Bea 


We want to construct sets SZ”. SY of power m, which further satisfy 
(8) U = 0. 
<a 


But then clearly 


(9) (u sz’) = 0. 


Thus the set U.S,” is independent and clearly of power m. Thus we 
only have to construct S$”. We shall use transfinite induction. Let as 
before Q; be the initial number belonging to m and let N be any set 
with card N2n*-. We construct a ramification system belonging to 
N as follows: Consider the disjoint sets 


(10) 
N= U_  Ny”.; card (Ni,”.) = card N. 
Clearly there are n*»< N such sets, thus since N?=N this is clearly 
possible. The sets (10) will be the Q,th (and last) column of the 
ramification system. The sets of the kth, 1=k<Q,, column we de- 


fine as follows: (41, - again run through the ordinals 
less than Q; and j7<k) 


(11) 


this means that N?... is the union of all the sets of (10) whose indices 
agree with it for 7<k. (The Oth column N™ is N.) We will denote 
this ramification system by R(N). 

Consider now R(Sj’ ). Let a be any element of SJ’. Since card (f(a)) 
<n there must exist an (S{’)® (that is, a set of the first column of 
R(Si')) such that f(a)OS{’ )@=0. Now since mz (card Sj’ is 
regular (thus not the sum of m sets of power <mz,) there exists a set 
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S}? CS?’ with card S} =m, and an index i{} < Q; so that 


Consider now R(S}'). Let a be any element of Sj’. As before there 
exists a set S3’/C.Sj’ with card Sj! =m; and 
rn, (2) 


HSY) AO = 0, (SY) Em = 0, 


where iq”, 7$”, i are less than Q;. Again we construct R(S;), and so 


on. Assume we have already completed this construction for all 
ordinals B<a< Q,. This means that there exists for all ordinals 
<a sets SY, with card S!’=m, and so that for all 5<y we have 
k ” 
where i$” - - - are ordinals less than Q; and k=1+(y—5). Now we 
construct the ath step as follows: Let a be any element of S2’. Since 
card (f(a)) <n, there exists for every 8 <a an index i < Q; such that 
rr 


SQ COV = 0, = 1+ (a — 8). 
The number of possible choices for all the i, 8<a, does not exceed 
n® < Ma, 
since the generalized continuum hypothesis was assumed to be true 
(it is clear from the generalized continuum hypothesis that a? 
<= max (at, b+) where at is the cardinal number immediately follow- 


ing a). Thus since m, is regular there exists a set S,) of power ma 
such that for all B<a 


rn (k) 


CV = 0, k=1+(a— 8). 


Now we construct our ramification system R(S2’). We continue this 
construction for every ordinal 5< Q,. Thus we obtain the sets 


(Q,) 


(12) (Ss)i,@,... = D;, <a. 


The sets (12) all occur in the last (that is, 2,th) column of their 
ramification system. By construction they satisfy 


f(Ds,) CO Ds, = 9 for d2 < 6; < Q,. 


Thus we can put 


Ds = Se 


[April 
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and this completes the proof of Theorem V. 

In the first paper on this subject Lazar* proved (without assuming 
the continuum hypothesis) that Theorem V holds if f(a) is finite and 
card S=c. In fact his proof gives that S is the union of Ny independ- 
ent sets. One can ask the question what happens in the general case? 
It is easy to see that if f(a) is finiteand has »ot more than & elements 
and S is also finite, then S is the union of 2k+-1 independent sets, and 
2k+1 is best possible. By a method of K@Gnig® it is easy to see that 
this also holds if S is countable. I conjectured that it is true for all 
sets S. Clearly this would be a consequence of the following result: 
Let G be a graph. Assume that all finite subgraphs of G are the union 
of r independent sets. (Two points are independent if they are not 
connected. A set of vertices is independent if any two of them are 
independent, and a graph G’ is said to be the sum of r independent 
sets if the vertices of the graph can be split into r independent sets.) 
Then G is the union of r independent sets. De Bruijn recently proved 
this conjecture (written communication). In general perhaps the 
following result holds: Let card (f(a))<m. Then S is the sum of n 
independent sets (# is an infinite cardinal number). 

Let S be any set of real numbers. We define S; to be the set of all 
numbers 


k 
¢, rational, a, € S. 
r=1 


Let H be a Hamel! basis. It has been remarked” that the sets H; can- 
not all be measurable. Indeed since U;H; is the set of all real numbers, 
they cannot all have measure 0. Suppose that A; has nonzero meas- 
ure. If the inner measure of H; would be positive, then by a well 
known theorem of Steinhaus'! there would exist a 6 such that if 
x| <5, x=a—b, aC Hi, H,. But then contains the set of all 
real numbers, which is clearly impossible. 

Sierpinski" proved that there exist Hamel bases which are non- 
measurable and also Hamel bases of measure 0. We shall prove that 
for every k there exists a Hamel basis such that H; has measure 0 but 
is nonmeasurable. 

Let 0S Ai <A2< - - - be an infinite sequence of integers, such that 
every integer is the sum of k+1 A’s, and the number of A’s not ex- 

8 Compositio Math. vol. 3 (1936) p. 304. 

® Theorie der endlichen und unendlichen Graphen, pp. 81-85. 

10 This remark is probably due to Sierpinski, but I do not remember for sure. 


1 Fund. Math. vol. 1, pp. 93-104. 
2 Tbid. vol. 1, pp. 105-111. 
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ceeding m is less than cn'/*+!, We shall prove the existence of such 
sequences later. Denote by B the set of real numbers in (0, 1), which 
admit the representation: 


o A; 
>— with A; < j an arbitrary term of { A,}. 

First we prove that B; has measure 0. It clearly suffices to prove that 


>"*_, cb, where the c’s are fixed integers and b,€ B has measure 0. 
Let xCB, be arbitrary. We have 


k @ A;(r 
s=> > 0S Ai,(r) <j, 


where | u;| <c:-j and we have, for u; less than c’, j*/*+! choices. Con- 
sider all the numbers 


The number of these numbers is less than c’"(m!)*/*+!, Consider all the 
intervals whose centers are all these points and whose length equals 
4c2/n!. It follows from | <j that all the are in the 
interior of these intervals, that is, the whole set > -*_,c,b,, b,CB, is 
covered by them. The sum of the length of these intervals is 
<4c(c'"/n!/*+1) =0(1), which proves that B, has measure 0. 

Now we construct a Hamel basis HCB, with Hi,; nonmeasurable. 
First we construct a set LCB such that L is rationally independent 
(that is, any finite subset of L is rationally independent) and Ly,, is 
nonmeasurable. Let M=A,+A2+ - - - +A. It is well known (and 
easy to see) that the real numbers z, 0<z<1, in whose representa- 
tion z= > ,u;/j!, OSuj;<j, we have infinitely often u;=M, have 
measure 1. Denote this set of real numbers by I. Let { Fa}, a<Q,, 
be the collection of all the perfect subsets of Jy (Q, is the initial 
number of c). Suppose we have already constructed a set L™ of 
power less than c, which is rationally independent and such that 
L®, intersects all the Fz with a<f. Let 2& Fs, z@U;L™. Such a z 
exists since the power of U;L® is less than c. Consider the equation 


u; 
2= Xt + ten, 4 CB, > 


j=2 


k=2 k! | 
(+) 
0S 4;, 1; < 2. 
ji! | 
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This equation is solvable. Also clearly we have c choices for x, ¢ 
choices for x2 after we have chosen x, and so on. Finally we have ¢ 
choices for x, after we have chosen x1, %2, - - - , The proof of 
these statements is immediate, since uj= ) 3*} A® is solvable and 
since, in the representation of z, M=Ai+ -- - occurs in- 
finitely often, say u;,=uj;,= --- =M. We can clearly interchange 
the jith, joth, - - - digits of the x’s (those digits are Ai, As, - - - , Axys) 
and thus obtain the required c solutions. Thus we can clearly choose 
xi, 1k, so that the set consisting of L™, the x;,i<k, and z should be 
rationally independent. But then the set consisting of L™ and the 
xi, 1Sk+1, is also rationally independent. This set we denote by 
L®+», Clearly L&*” intersects Fs, and the power of L&* is less than 
c. Put now L=Usco,L™. Clearly Li: intersects all perfect subsets of 
Iy, thus can not have measure 0 (I has measure 1), but since L is 
rationally independent, Zi4; can not have positive inner measure (as 
was shown before). Thus Zz4; is nonmeasurable. Now since By; 
clearly consists of all real numbers, we can find a Hamel basis HCB 
which contains L. Clearly H;+; is also nonmeasurable and H;C B,; has 
measure 0, which completes the proof. 

We now have to construct our sequence A_ with the required 
properties: Denote by T,, 0Sr<k+1, the sequence of integers of the 
form €;2/, n=1, 2, - --, or 1, and all the are 0 except 
possibly for those with j7=r (mod (k+1)). Consider now the sequence 
U,T,, this sequence clearly has the required properties." 

Let S be a set of power greater than Np but smaller then the first 
strongly inaccessible cardinal number (>No). It is well known that 
no countably additive two-valued measure can exist for the subsets 
of S, so that elements have measure 0 and the whole set S has meas- 
ure 1. Ulam now raised the question (oral communication): What is 
the smallest cardinal number m so that there should exist ” two- 
valued measures defined for the subsets of S (elements of S having 
measure 0 and S having measure 1, in each of them) with the prop- 
erty that each subset of S is measurable in at least one of these 
measures? Ulam proved that n2No. Alaoglu and I proved m2N,. I 
now present our proof. 

Suppose the result is false. Then there exist countably many meas- 
ures Mi, Ms, - - - so that each subset of S is measurable for at least 
one of these measures. Split the set S into the union of N sets each 
nonmeasurable in M;.“ Clearly only countably many can have posi- 
tive measure in any M;, thus there must exist two of them, S; and 


18 This example is due to Stéhr, Math. Zeit. vol. 42, pp. 739-743. 
4S. M. Ulam, Fund. Math. vol. 16 (1930) p. 142. 
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S{ say, which are both nonmeasurable in M, and whose union does 
not have positive measure in any M,. Split now S—S,—S{ into the 
union of &, disjoint sets each nonmeasurable in M2. Denote these 
sets by U;, U2, Us, It is easy to see that among the sets 
Si: +S{+ U; there are only countably many of positive measure in any 
M,. Thus there exist two of them, S, and Sj say, such that S,+.S{+5S; 
+.S} does not have positive measure for any M;,. Consider now the 
set S—S,—S{—S.—S}, and repeat the same operation. We can clearly 
repeat this operation for any k, and the set U;S; will be nonmeasur- 
able for any M;, since it contains S; but is disjoint to Sj. We cannot 
decide the question whether »>§N,. It seems probable that n is 
greater than the power of S. 

Some simple remarks about ordinal numbers: (1) A sequence of 
ordinals (not necessarily countable) 8;< Q is said to be rarified if 
lim Bi41—B8i = Q:, also every bounded sequence is said to be rarified 
(that is, a sequence for which lim 6;= Q,). Sierpinski remarked that 
the ordinals a< Q, are the sum of N, but not of a finite number of 
rarified sequences. Assume that 2 has an immediate predecessor. 
Then the ordinals a< Q are the sum of Q,_; but not fewer rarified 
sequences. If 2; does not have an immediate predecessor and , is 
the smallest ordinal cofinal with it, then the ordinals a< Q; are the 
sum of 2 but not fewer rarified sequences. If 2; is weakly inaccessible, 
that is, 2; = Q;, then the ordinals a< Q; are not the sum of fewer than 
Q, rarified sequences, or the weakly inaccessible numbers are the 
only ones which are not the sum of fewer rarified sequences. 

Similarly we can define a sequence 8;<{ to be r-rarified if 
lim Bi41—8; = Q,. Then the ordinals a< Q are the sum of min (Q,, Q;) 
but not fewer r-rarified sequences. 

(2) Dushnik" proved the following theorem: Let m be a regular 
number, Q the initial number belonging to it. Let f(a), a< %, be 
such that for all a, f(a) <a. Then there exists an ordinal 8 such that 
the equation f(a)=6 has m solutions. For singular numbers the 
theorem is false. The following generalization holds: Assume that 
is not cofinal to w. Then there exists an ordinal 8 and a sequence a; 
cofinal with Q such that f(a,;) <8. For regular numbers one im- 
mediately obtains from this Dushnik’s theorem. If Q, is cofinal to 
w, the analogous result clearly does not hold. Dushnik’s proof would 
easily give a proof of this theorem (oral communication) but perhaps 
the following proof is of some interest: Suppose the theorem is false. 
Then to every 6 there exists a ¢(8) < Q such that for 62¢4(8), f(6) >8. 


18 Revista de Sciencias (Lima) vol. 41 (1939) pp. 289-296. 
#6 Bull. Amer. Math. Soc. vol. 37 (1931) pp. 860-862. 
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Consider now 
y = o(1) + o(4(1)) +---. 


Clearly 7 < Q; since Q, is not cofinal with w. But then from the defini- 
tion of y, f(y) 27, an evident contradiction; this completes the proof. 


UNIVERSITY OF MICHIGAN 


ON A PROBLEM OF G. BIRKHOFF 
TADASI NAKAYAMA AND JUNJI HASHIMOTO 


In his book Lattice theory, G. Birkhoff proposed to prove that the 
representation of a finite partially ordered system as the product of 
indecomposable factors is unique within pairwise isomorphism of 
factors.! The present short note is to show that this is not the case in 
general. A simple counterexample, and indeed one of the simplest, 
perhaps, can be constructed as follows: 

Let X be the lattice {0, 1 } of two elements 0, 1 (0 <1), for instance, 
and A be the partially ordered system 


I+ X + X4+ X5, 


where J resp. X‘ stands for the one-lattice resp. the direct product of 
4 copies of X, and where + means direct summation. The finite 
partially ordered system A may be expressed also by f(X) with the 
polynomial f(x) =1+x*+x?+2°+2'+<5. Since every X‘ has the up to 
isomorphism unique decomposition into indecomposable factors, 
X‘=XX - - - X (i factors), one sees easily that direct decompositions 
of A are, in the sense of isomorphism, in 1-1 correspondence with 
factorizations of polynomial f(x) =1+x+x?+25+x'+25 into factors 
with non-negative rational integral coefficients. But our f(x) has two 
distinct decompositions into factors which are irreducible in the pre- 
scribed sense, namely 


f(x) = (1+ 2° + = (14+ 2’). 
Two direct decompositions 


A=(T+X)\I+ X*+X9= X + 


Received by the editors October 28, 1948. 
1 G. Birkhoff, Lattice theory, Amer. Math. Soc. Colloquium Publications, vol. 25, 
New York, 1940, Problem 8. 
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of A give thus a counter-example to the conjecture. 

Nevertheless the conjecture is true in the important special case of 
directly sum-indecomposable systems, as will be shown in a forth- 
coming paper by one of the writers.? Making use of the result we 
may formulate a criterion of unique decomposition as follows: First 
we decompose the given system into the direct sum of sum-inde- 
composable systems, then decompose the so-obtained summands into 
a product of (product-)indecomposable factors, construct a poly- 
nomial by taking a variable for each isomorphism type of such 
(product-)indecomposable factors, and finally examine whether the 
factorization of the polynomial (with several variables, perhaps) into 
factors irreducible within the domain of polynomials with non-nega- 
tive rational integral coefficients is unique.* 

We want also to suggest a generalization of the notion of partially 
ordered system by allowing negative coefficients, as in the case of 
algebraic complex in topology. This change of notion will make Birk- 
hoff’s conjecture true. 

Added in proof: Our above example, communicated to Professor 
Birkhoff, has been incorporated into the revised second edition of 
Lattice theory, p. 26. 


NacGoya UNIVERSITY AND 
Kose TECHNICAL COLLEGE 


2 J. Hashimoto, On the product decomposition of partially ordered set, Mat. Japonicae 
vol. 1 (1949). 


§ This clarifies two extreme cases cited by Birkhoff, loc. cit. 
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THE DIFFERENCE BETWEEN CONSECUTIVE 
PRIME NUMBERS. IV 


R. A. RANKIN 


1. Introduction. Let », denote the mth prime, and let © be the 
lower bound of all positive numbers go such that no Dirichlet L-func- 
tion L(s, x) has a zero at s=o+it, where ¢>oo. Write 


The purpose of this paper is to combine the methods used in two 
earlier papers' in order to prove the following theorem. 


THEOREM. 
(1) lS c(1 + 40)/5, 
where c<42/43. 


The precise definition of ¢ is given by (18) and (8) below. 
The theorem is an improvement on the result /<(1+40)/5 ob- 
tained in II. It was shown in III that 


(2) 1<d=0.966--- < 57/59. 
The numbers ¢ and d are connected by the relation 
1—d=(3+ 


so that (1) is an improvement on (2) only if © is not too close to 
unity, in fact, if 0<0.986 - - - . In particular, if the “grand Riemann 
hypothesis” is true, that is, if @=1/2, we have (using the value of ¢ 
given by (18)) 


3c/5 < 109/186 = 0.58602 - - - 


2. Notation. Since (2) is sharper than (1) for @ =1, we shall assume 
that 0 <1, and write 


(3) o = (1+ 40)/5. 
Let N be a large positive integer, and define 
(4) X = N*(log N)-80+» 


Received by the editors January 4, 1949. 
1 See [2] and [3]; I refer to these as II and III respectively. Numbers in brackets 
refer to the references cited at the end of the paper. 


143 


| 

| 
log Pn 

| 

| 

| 


144 R. A. RANKIN 


{April 


as done in equation (6.7) of II; here y is a positive constant which 
may be taken to be 2. 


Let @(v) be the number of prime pairs p, p’ which satisfy 
N<? S78 2N, p—?p' =», 


where v is an integer. We suppose that h, k, m, n and r are integers 
such that, for some positive Ai, Az independent of N, 


(5) A, log N < 2k/3 <h<k< AzlogN, 
(6) OSm<n<Azlog N, 
r=k—h. 


The letters a, c, a, 8 denote positive numbers independent of N, 
and we write 


1 
7) D= (1 + 
and take 
(8) A = 7.3566 > e?7A,/2, 


where ¥ is Euler’s constant and A» is Buchstab’s number 46.67347 
(see relation (4) of III). 


For u=1,2,---, k we define a, as follows: 
a 
(9) 
B (r < > k— r), 
and write 
k 
Then 
k-1 
(11) | |? = £(0) + 2 cos 4x08, 
where 


= 


jSiSk 
3. Lemmas. 


Lemma 1. We have 


| 
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2(r — v)a? + 2vaB + (k — 2r — 
2raB + (k — 2r — v)p? (r Sv < k — 2r), 
=) — b+ va? + Uk — — 
(k — v)a? (k-—rsv< 
and 
1 
= { 2ra + (k — 2r)B}? + O(log N). 


This follows easily from (9) and (11), and we omit the proof. It is 
in this lemma that the restriction k—r=h>2k/3 made in (5) is 
used. 


LEMMA 2. 


$7(Q) 


eAtivfia 


2 
=— —— } + O(vX—(log log X)?). 
; (vX-*(log log X)?) 


Here (gq) is the Mobius function, ¢(q) is Euler’s function, and D is 
defined by (7). 


It follows from the well known relation 


q-1 
= dy (4) 


f=0,(/,q)=1 a| (2%,¢) 
that 
u?(q) 


$°(9) (2.4) 


a>X pi 


By a straightforward reduction the first term on the right may be 
shown to equal 


2 
D Li 2 
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See, for example, §§3.21-3.23 of [1]. Since 1/¢(m) =O(n-' log log n), 
the second term on the right is 


04 x-*(log log X)? [] (p = O{»X—(log log X)?}. 


p\2 


Lemma 3. If s20 and is independent of N, then 


n—1 1 n—1 
TI (=) = D >> + Of (log N)*(log log N)*}. 


y=m ply, p>2 =m 


The proof is similar to that of the lemma in III. We suppose that d 


is any product );,);, - - - pi; of distinct odd primes, and let d’ be the 
associated product 


= (pi, — 2)(Di, — 2) (Di; — 2). 


We permit d to take the value unity and then put d’=1. Then, 
clearly, 


1 1 


d<n dd’ d21 dd’ 


+ (log log 
n 


1 
D+ of — (log log an) 
n 


since 1/d’ =O{d-"(log log d)?} for large? d; and 


M 
| 


For a given d’, v is a multiple of d, say y=yd, and, since m Sv <n, pu 
takes all integer values in the interval m/d sy<n/d, which we denote 
by I(d). Clearly 


o+1 


n m 
4d)? = O *), 
tO 
and \ 
net) — 
O 
— 


2 It is possible to dispense with the factor (log log 3n)*. 
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Hence 
n—1 
vam ply.p>2 p —2 (a) 
get? 1 1 
O 8 
s+1 Lat 5) 


1 
+0 (— (log log 
n 


+ o{ n*(log log 3)? log 2n} 


n—1 


D O{n*(log 2n)*}, 


ll 


and the lemma follows from (6). 
Lemma 4. 


k—1 


2 >> + Of log N(log log N)?} 


{ 2ra + (k — 2r)B}? + Oflog N(log log N)?}. 


By Lemma 2, the left-hand member is equal to 


p\y,p>2 


It follows from Lemmas 1 and 3 (with s=0, 1) that this equals 


k—1 
&(v) + Oflog N(log log N)?} + O{ X-1(log N)*(log log X)*}, 


v=1 
and the lemma follows from (4) and the second part of Lemma 1. 


4. Proof of the theorem. By considering the expression 


I(N, k) = f | (6) 


where 


= log p (p prime), 


N<pa2Nn 


| 
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and (6) is defined by (10), it is possible to show that, for large N, 
I(N, k) = &(0)N log N{1 + o(1)} 


k—1 
+ 2{1 + o(1)} log? NV >> &(v)a(2») 


2 
smo,(7,0)—1 
(12) + o(N log? NV) 


$(q) 


1Sqsx $7(Q) 


q-1 k—1 


+ o(N log? N). 


f=0,(/,q)=1 


The proof of this is similar in every respect to the proof of (4.1) 
and (6.8) of II, except that W(&) is defined by (10) and is not 
>rrie?* as in II. The restriction k<log N made in II is unessential 
and can be replaced by k< Az log N. 


It follows from (3), (4), (12), Lemma 4, and Lemma 4 of? II that 


k—-1 
2{1 + 0(1)} log? 
> 4N{1 + 0(1)} >> &(») — oN log N{1 + o(1) }£(0) 
ven] 
(13) + o(N log? N) 
= 2N{2ra + (k — 2r)B}*{1 + o(1)} 
— oN log N{2ra? + (k — 2r)p*} {1 + o0(1)} 
+ o(N log? N). 


Now, just as in III, it can be shown by Buchstab’s form of Brun’s 
method‘ that, for sufficiently large N, 


’ There is a misprint in the enunciation of the lemma which should read: “If X 
is large Doisesx X.” 

‘ We must consider the interval (N, 2.) instead of the interval (x/log x, x). Our 
only reason for considering an interval of the form (N, 2N) is that it enables us to 
quote results directly from II. As far as Brun’s method is concerned, the inequality 
stated remains true if we replace the left-hand side 4(2v) by the number of prime pairs 
b’ which satisfy p—p’ =2»>0 where A 20. 


| 
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—1 
@(2r) < ———__ J] 
D(log N)* \P — 2 


where X is defined by (8). It follows from Lemmas 1 and 3 that 


k-1 4\N 
< —— &(v) + (log log 


lo N 2 
(14) 
of (log 1 
= 
(log N)? log N aia 


since r=O(log N). 
Accordingly, if we make the assumption that @(2v)=0 for all 
v<h=k-—r we have, from (13) and (14), that 


2ra + (k — 2 3 
{ {1+ o(1)} 
log N 


1 2ra? k — 2r)p? 
(15) of ro? + ( r)B 
log V 


2 


r 2 
< ( 
log N 


We now take a=1, 8=2"?, and put 


= |— og N |, 

1 


(16) 


where a is a fixed positive number satisfying 
(17) a<3— 23/2, 


It is easily verified that, with this choice of a, h>2k/3. 
Then, for large N, 


= (24? + 1) log + 0(1), 
1 
+ (kt — 2r)B = (2 — + = log N + (1), 


2ra® + (k — 2r)6* = 2h= o(1 log N + O(1). 


| 
| 
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It follows from (15) that 
1 1 a oa(2'/? + 1)) 2 
2 2 r 2r 


a= 3 — 23/3, 


that is, that 


which contradicts (17). 
We conclude, accordingly, that @(2v) >0 for some integer » satis- 


fying 
1 a 
o(1 log N. 
2 
It follows from this inequality, and from (17), that 
lS oc 
where 


42 
(18) c= 1— (3 = 0.97667 


and this proves the theorem. 

Note added 24 April 1949. Professor Erdés has informed me that 
considerable improvements have recently been made in Brun’s 
method by A. Selberg and others; this will presumably mean that 
Buchstab’s constant Ao, and consequently my constants ¢ and d 
(see equations (2) and (18)), can be replaced by smaller values. 
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FAMILIES OF SOLUTIONS OF THE UNILATERAL 
MATRIX EQUATION 


JAMES H. BELL 


1. Introduction and summary. Let 7 be a field of characteristic 
zero, and let R,, (m=0, 1, - - - , r) be Xm matrices with elements in 
F. M. H. Ingraham [1]! has shown that the unilateral matrix equa- 
tion >"%,-oR.X"=0 may have an infinitude of solutions. In this 
paper a study is made of the conditions under which such an infini- 
tude of solutions will exist. All matrices considered in this paper will 
be square and of order m unless otherwise specified. 

The matrix X, is similar to the matrix X;2 if there exists a non- 
singular matrix T such that 7X,7-'!=X; and the elements of Xi, X3 
and T are in 7. The polynomial domain Fh] of F is a principal ideal 
ring. The matrix ) aE is a A-matrix, that is, its elements are in 
¥ [A]. A square \-matrix is unimodular if it has an inverse which is 
also a \-matrix. Two A-matrices A and B are left associates if there is 
a unimodular matrix U such that UA=B. The relationship of Jeft 
associativity is an equals relationship. Every square \-matrix is a left 
associate of a unique matrix which is in canonical triangular form. 
This matrix has all elements below the main diagonal equal to zero. 
If a diagonal element is zero, its row consists entirely of zeros. The 
leading coefficient of each nonzero diagonal element is unity, and the 
elements in that column are reduced modulo the diagonal element.? 

If M, P, and A are \-matrices such that M= PA, then A is said to 
be a right divisor of M, and M is a left multiple of A. Every pair of 
A-matrices has a greatest common right divisor (g.c.r.d.) D and every 
pair of nonsingular \-matrices has a least common left multiple 
(1.c.l.m.) M which is unique up to left associates [2]. 

The question arises: Given two nonsingular matrices A and B, 
what other right divisors of their l.c.l.m. M are there which are not 
left associates of A or B? It has been shown’ that Xj is a solution 


Presented to the Society, September 7, 1948; received by the editors January 21, 
1949. 

1 Numbers in brackets refer to the bibliography at the end of the paper. 

2 For these definitions and terms see MacDuffee [2]. The Hermite normal form 
as defined by MacDuffee is the form generally used. The canonical triangular form 
defined here is used in keeping with that used by Ingraham [1]. It is obtained in the 
same manner as the Hermite normal form except the operations are carried out in 
the columns in reverse order. 

* The proof of the factor theorem for polynomials with coefficients in a ring with 
an identity element is given by Albert [3]. 


151 


152 J. H. BELL [April 


of Sinno RnX"=0 if and only if AJ—X; is a right divisor of 
> R,,A”. Therefore, if there exist two distinct solutions X; and 
Xz, the l.c.l.m. of Xi and AI —X2is a right divisor of 
If this l.c.l.m. has a right divisor of the form AJ — X;, where X; is dif- 
ferent from X, and X2, then X; is another solution of the unilateral 
equation. Thus, a study of the above problem yields results ap- 
plicable to the unilateral matrix equation. The A-matrix A is a right 
divisor of the A-matrix M if and only if its canonical triangular form is 
a right divisor of the canonical triangular form of M. Consequently, 
the matrices A, B, and M will be considered as being in canonical 
triangular form. 

If the A-matrix R(A) has a set of right divisors whose canonical 
triangular forms have the same main diagonal, then the set will be 
called a family of right divisors of R(A). A matrix of the form AJ—X 
where X has elements in 7 will be called a monic matrix of degree one. 
It is easily shown that two monic matrices are left associates if and 
only if they are identical. If a family of right divisors of R(\) are the 
left associates of monic matrices of degree one, the matrices X ob- 
tained from the monic matrices are solutions of R(X)=0 and will 
be called a family of solutions of R(X) =0. 

It is shown that a necessary and sufficient condition for the 
existence of an infinite family of solutions of the unilateral matrix 
equation is that there be two distinct solutions which are similar. All 
matrices similar to a given solution are solutions of the equation if 
and only if their minimal polynomial is a common divisor of the ele- 
ments of >-%,-oRmA™. An interesting result arises as a by-product of 
this second result, namely: The equation 
XRu=0), where R,,- XX™ is the direct product of Rn 
and X™ [2], has a solution if and only if the elements of >~7,-oRn»A”™ 
have a greatest common divisor which is not a unit. (In this problem 
the order of X is not restricted to n.) It is also shown that if 
has a solution, and if the determinant of 
is zero identically, then there is an infinitude of solutions of the 
equation. 


2. Families of solutions. The following theorem provides a basis for 
the results obtained in this paper. 


THEOREM 1. If P=QA, where P is a nonsingular \-matrix, P and A 
being in canonical triangular form, and tf for some values j and k, where 
j>k, it is true that qu, and a;; are not relatively prime; then there exists a 
family of right divisors of P which are not left associates. Furthermore, 
if A is the left associate of a monic matrix of degree one, then the mem- 
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bers of the family, with a finite number of exceptions, are also left asso- 
ciates of monic matrices of degree one. 


Since P and A are triangular matrices and P is nonsingular, it fol- 
lows that Q and A are nonsingular, and Q is also triangular. Hence, 
Pum (mM=1,2,---, 2). 

Consider a possible factorization of P into P=BA’, where A’ has 
the same diagonal elements as A and is in canonical triangular form. 
One such factorization exists, namely P=QA. The matrix B will bea 
triangular matrix. 

Represent the greatest common divisor of two polynomials f(A) 
and g(A) by the symbol (f(A), g(A)). 

Select the values of k and j so that: (qu, a;;) #1 and 7>k; (qmm, 2pp) 
=1 when m<p and p>j; (qmm, @;;)=1 for m<k. Also let 


(m, p=1, 2, p=j, m=k-+1, k+2, »j—1). Since 
bam =Qunm (m=1, 2,---,n) and bap=gup (m, p=1, 2,---,j—1; 
p=j, m=k+1, k+2,---,j-—1), the element aj, is obtained from 


the following equation: 


m=k+1 
that is, 
i-1 
m=k+1 

This congruence has a solution, namely a;;. Therefore (qi, a;;) 
divides and ay =x1(A) + ( fi(A) 
where x:(A) and f;(A) are uniquely determined, p is the degree of 
@;;) and (m=1, 2, -- +, p) are arbitrary parameters. The re- 
maining elements of A’ are uniquely determined in terms of aj, and 
the other elements previously found, since (dmm, @;;)=1 (m<k), and 
2pp) =1 (p>j, p>m). 

Therefore, in solving for A’ an infinite family of right divisors of P 
is obtained. Furthermore, each member of the family is in canonical 
triangular form. Hence, no two distinct members of the family may 
be left associates. 

The existence of a family of right divisors of P has been shown. In 
the construction above the parameters first occur in aj, and may occur 
in the elements a), (m<k, p=j; p=j+1, - - - , m). When the param- 
eters occur in an element, they will appear in the coefficients of the 
various powers of A in that element. In this case the coefficients will 
be polynomials in the parameters. A certain combination of values 
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for the parameters will yield A from A’. Assign these values to all 
but one of the parameters. The modified matrix A’ will involve only 
one parameter, and by assigning the proper value to this parameter A 
w?'l be obtained from A’. 

If there is a unimodular matrix T such that TA =I —X, then it is 
true that A’ will be the left associate of a matrix \J—X’ except 
possibly for a finite set of values of the parameter. The necessary 
and sufficient conditions that the canonical triangular matrix 
A= > s,ncA wd" be the left associate of a monic matrix of degree one 
are: The degree of []%-~,¢mm should equal , the degree of [[4,-:dmm 
should be equal to or less than 7 (j<m), and the matrix W; 
=||A, -Ail|shall be of rank n. Since - - -,n) 
the first conditions are satisfied. The matrix W; consists of at most 
nonzero columns.‘ That is, the rank of W, is the rank of an Xn 
matrix D formed from the nonzero columns of W;. Because | D] is a 
polynomial in the parameter and | D| £0 (since TA =AI —X), it fol- 
lows that | D| =0 for only a finite set of values of the parameter. 
Therefore there is an infinite family of right divisors of P which are 
left associates of monic matrices of degree one. 

The following theorem is due to V. J. Varineau [4]. 


THEOREM 2. If the least common left multiple of two nonsingular 
matrices A and B, with elements in a principal ideal ring, is M, and 
their greatest common right divisor is D; then | M|\ -|D| =u|A|-|B| 
where u is a unit. 


THEOREM 3. The unilateral matrix equation >-7,-oR»X"=0 has an 
infinite family of solutions if and only if there exist two distinct solutions 
X, and X_ which are similar. 


Let A, B, and M be the canonical triangular forms of \J—X,, 
AI and the l.c.l.m. of and AI—X;z respectively. Since 
Xi and X; are solutions of >-%.>R,X"=0, A and B are right 
divisors of }°y,-oRmA”. Therefore M is a right divisor of >~%,->R»d”. 
It will be shown that M has a family of right divisors of the form 
\J—X’, and thus DraoknX"=0 has a family of solutions. 

Since |A| =|B| =|\AJ—X,| =|\AJ—X;|, M is nonsingular. Also 
M=PA=QB, where P and Q are triangular \-matrices. Either 
= Omm (m=1,2, - - , 2) orelse not all 
If (m=1, 2,---, m) it follows that (px, a;;)~1 for some 
values j and k where j>k. If this were not so, there would be a 


* Published as note without proof in Ingraham [1]. Proof given in a paper by 
Bell [5]. 
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unique right divisor of M having the elements a, as the main 
diagonal elements, whereas there are at least two such divisors, 
namely A and B. The theorem follows from Theorem 1. 

Assume, in the second case, that not every dn» is equal to the 
corresponding Dm. Let 7 be the least value of m such that a;;#bj;;. 
Then a;;=f(A)h() and where (f(A), g(A))=1. Since 
M5; = = it follows that g(A) divides and f(A) divides q,;;. 
Because | |[i<4am= [[k-um, it follows that some factor of g(a) 
divides some element a,, (P>j), and some factor of f(A) divides some 
element Dy, (k >j). That is, #1 (R>j) and (ay, p;;)¥1 (p>). 
By Theorem 1, there exists a family of solutions whose corresponding 
canonical triangular forms have the same main diagonal as A, and 
another family, whose corresponding canonical triangular forms have 
the same main diagonal as B. 

If there is a family of solutions, let X, be one of the solutions. All 
members of the family will satisfy the scalar equation |~Al—X,| =0. 
A scalar equation of finite degree may have only a finite number of 
dissimilar solutions [2]. Therefore, there must exist at least two 
similar solutions. Furthermore, there must be at least one class of 
similar solutions which is infinite in number. This completes the proof 
of Theorem 3. 


3. The solution of >°7,.oRm- XX"=0. If Xi is a solution of the 
unilateral equation and X;: is a matrix similar to Xi, it does not fol- 
low that X2 is a solution. Since X2.=SX,S—'!, X2 will be a solution if 
and only if >-%,-oR»SX"=0. A nonzero, but not necessarily non- 
singular, solution for S will exist if and only if | ),-oRn- X(X7)™| 
=0 (where A- XB=(Ab;,;) and X7 is the transpose of X) [2]. The 
form of the matrix involved in this determinant creates an interesting 
problem namely: When does the equation > XX"=0 have a 
solution? It should be noticed that X need no longer be of order n. 


THEOREM 4. The equation where Rn 
(m=0,1,---,7) and X have elements in F, has a solution if and only 
if the elements of >-‘,-oRmd™ have a greatest common divisor which is 
not a unit. 


Consider the matrix where Rn=(r9) and 
m = (xi). By simply rearranging its elements this matrix may be 
changed into the matrix If then 
X")=0 and conversely. Therefore, > and 
the solution X must satisfy the scalar equation > %,-orf”X"=0, for 


each value of i and j. Hence the elements of the matrix }-%,-oR,.A” 
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must have the minimum polynomial of X as a common divisor. 

If the elements of >%,-oR»A" have a greatest common divisor 
which is not a unit, then the companion matrix of that polynomial 
will be a solution of the equations }%,-979” X"=0 and hence will bea 
solution of >-%,-.Rm- XX"=0. Similar considerations show that the 
same condition is necessary and sufficient for the solution of >~%,-9 X™ 
-XRn=0. A solution of xX"=0 is a solution of 
X Rm =0 and conversely. 


Coro.iary. The equations XX"=O and > XRn 
=0 will have a solution of order n if and only if the scalar equation 
formed by setting the g.c.d. of the elements of >~7,-oRn equal to zero has 
a solution of order n. 


The above condition will certainly be true if the elements of 
> n-0RmA™ have a common divisor whose degree is a divisor of n. 


THEOREM 5. If X; is a solution of the unilateral matrix equation 
> 7,<0oRmX"™=0, then all matrices similar to X, are solutions if and 
only if the minimal polynomial of X; is a common divisor of the ele- 


ments of > 


If the minimal polynomial g(A) of X1 is a common divisor of the 
elements of it follows that = Q(A) [g(A) J]. 

If is similar to X, then g(X.)=0 or 
That is, = P(A) (AT — X2) and X2 is a solution of the 
unilateral equation. 

To prove the converse, let X,=SX,S—'!. Since every matrix similar 
to X; is a solution, it follows that S is a solution of p Ba Ms fe 0. 
This is true for every nonsingular S. 

The equation }-7,-9R,SX1T=0 is equivalent to m? equations in 
the unknowns s;; (¢, j=1, 2,---,m). The matrix S may be con- 
sidered as a vector in mn? space. If there are m? nonsingular solutions 
S of >-j,<0RmSX1=0, and if these matrices, considered as vectors 
in ? space, are linearly independent, then it follows that }77,oRm 
(XT)"™=0. 

Let E;; be the Xm matrix which has 1 in the ith row and jth 
column and zeros elsewhere. The n?—m matrices J+ E;; (i¥j; i, 
j=i, 2,-+-+,m) are linearly independent and nonsingular. The 2 
matrices (¢=1, 2, ---+,m) are nonsingular and will 
be linearly independent if the a; (¢=1, 2, - - - , ~) are chosen so that 
the determinant of S,= (mM¥p; m, p=1, 
2,°°*+*, m) is not zero. This is obviously possible when »=2. A 
simple induction step suffices to show that it is possible for a general n. 
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A total of n? linearly independent nonsingular matrices have 
been constructed above. If every X2 similar to X; is a solution of 
then has n? linearly independent 
solutions for S. Therefore, X(X7)™=0, that is, X7 is a 
solution of a me xX"=0. Theorem 5 follows from Theorem 4 
since the minimal polynomial of X7 is the minimal polynomial of Xi. 


4. Infinite sets of solutions which are not families. There may be 
an infinite number of solutions of the unilateral equation without 
there being an infinite family. This fact is brought out in the proof of 
the following theorem. 


THEOREM 6. If | >-i,-oRnA"| =0, and if there exists a solution X, 
of the unilateral equation > ,<_oRmX"=O0, then there is an infinite 
number of solutions. 


The canonical triangular matrix P of DnoRaA* may be factored 
into the product QA where A is the canonical triangular form of 
and Q is triangular. Since | =5 Pmm=0 
(6 a unit), it follows that some element »;;=0, and therefore gi;=0. 

Factor P into Q’A’ letting af.=GQmm (m=1, 2,---,m), then 
Gam = Imm. Let gy be the first diagonal element of Q’ which is zero. 
Three possibilities arise in the factorization: 

1. (qj, @,)¥1, for some values p and j such that p>j, and 

2. Case 1 does not occur and k<n. 

3. Case 1 does not occur and k=n. 

In Case 1, as a result of Theorem 1, there exists an infinite family of 
solutions of the unilateral equation. The fact that some elements gj 
are zero may be taken care of by letting af ,4.._=1,1+m- 


If g=0, that is px. =0, (k<m), then pim=0 (m=k, R+1,---, 7). 
Since (m=1, 2,---+,m) it follows that all gf,=0. Choose 
Gin = Aim (l, m=n, n—1,---, k+1). The 


element aj, is obtained from the equation } ~*~: din@am=0. Since all 
Gim = 0, Af, may be chosen to be ax, with the coefficients of the powers 
of X replaced by parameters. Since (Ghim, Gn) =1 (m<k), the remain- 
ing terms of A’ may be expressed uniquely in terms of aj. As in 
Theorem 1, it follows that there is a family of right divisors of P 
with the same diagonal as A, and hence a family of solutions of the 
unilateral equation. In the final case k=m, and (Ghm, @pp) =1(m <p). 
Choose aj, =Gim (1, m=1, 2, -++,m—1). If a}, is chosen to be Gan, a 
unique factorization is obtained namely P=QA. Since gin=0, Gin 
may be chosen arbitrarily. Choose af, to be identical with a,,, except 
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in the constant term which will be a parameter. 

_It may easily be shown that if there is given a finite set of ir- 
reducible polynomials, and a given polynomial of fixed degree with 
an arbitrary constant, then for an infinite set of values of the arbitrary 
constant, the polynomial will be relatively prime to all of the poly- 
nomials in the given finite set. This follows from an application of 
Euclid’s algorithm and the fact that the field is of characteristic zero. 

If the finite set of irreducible polynomials is taken to be the factors 
of the elements (m=1, 2, - - - , then for an infinite set of 
values of the constant Gin) =1. Since (Gfm, nn) =1, Gan will be 
one of the aj. A unique factorization of P exists for each a{,. There- 
fore, there is an infinite set of right divisors of P. Since A is included 
in the set, there is an infinite set of right divisors of P which are left 
associates of monic matrices of degree 1. It follows that there is an 
infinite set of solutions of the unilateral equation. 

Note that in Case 3 the set of solutions, as defined, is not a family. 

It may be possible in certain cases to factor P in such a way as to 
get a family of right divisors which leads to a family of solutions. 
That this is not always true is shown by the following example in 
which 7 is the rational field R. Let 


of 


0 0 
There is no factorization of P in R[\] which will yield an infinite 
family of right divisors, and solutions of the equation: 


i @ 0 1 0 0 
0 0 0 0 0 0 
However, since 
A? 1 rX 
i 0 0 0 O 0 
there is the infinite set of solutions: 
0 t/e 
X= / 
0 
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MICHIGAN STATE COLLEGE 


CANONICAL POSITIVE DEFINITE MATRICES UNDER 
INTERNAL LINEAR TRANSFORMATIONS 


BERNARD VINOGRADE 


Let P be a real positive definite n-rowed square matrix, n=p+q 
+r, pSqsSr, S=diag (Si, Se, S3), where is p-rowed, is g-rowed, 
and 5S; is r-rowed, all nonsingular. S is called an internal linear trans- 
formation when it transforms P according to the rule PSPS’. All 
coefficients are real numbers. The problem is to find a canonical form 
for P, under internal linear transformations, depending on charac- 
teristic value systems associated with P. The partition of m into 
three parts is for convenience only. 


1. Semi-canonical reduction. Partition P according to the parti- 
tion of m, and call the square diagonal blocks P;, Pz, and P3, with 
g, and r rows respectively. Choose S:=Q;"', S:=Q;", S:=Q;" 
from the factorizations P2=(Q20/, P:=Q;Q3. Then 


Ip A B 
(1) SPS'=Q=|A' I, Cl, 


where J; is the i-rowed identity matrix. Any further transformations 
will be chosen so as to preserve the diagonal blocks J,, I,, I,. Hence 
we shall have S;S/ =J;, i=1, 2, 3. 


2. Non-special cases. Let X=(I,, A, B), Y=(A’, I,, C), Z 
=(B’, C’, I,), so XX’, YY’, and ZZ’ are all nonsingular and sym- 
metric. Let their characteristic roots be the ordered diagonal ele- 
ments of D,=diag (Ai, ---,Ap p), De=diag (wm, ---, pe), and 
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=diag (1,---,»,). If the X,’s are distinct, then an orthogonal 
matrix K, such that KiXX’K{j =D, is unique within a left factor 
diag (+1, ---, +1). If also the y,’s and the »,’s are distinct respec- 
tively, then S=diag (Ki, Kz, Ks) gives SQS’ in a canonical form 
with rows and corresponding columns orthogonal in groups of , q, 
and r, and coefficients unique within internal permutations and sign 
changes. 


3. Special cases. After carrying out the reduction of §2, let 
diag (Ki, Ke, Kz) stand for any further orthogonal internal trans- 
formation. Suppose there are a distinct \,;’s and a<. For conven- 
ience of discussion, assume there are a; \,’s in the first a; places in D,, 
Xi 1 and a,>1. Then in K, there is a free orthogonal a;-rowed block. 
Generally, Ki=diag (Ku, - - - , Kia), each Ki; orthogonal. It follows 
from (A, B)(A, B)’=diag (A.—1, - - - ,A,—1) thatif (A.,, are the 
first a; rows of (A, B) then (Aa,, Ba,)(Aa,, Ba,)’=(Ai—1) Ja,. Impose on 
Ky the condition (i, Aa,). If the are 
not distinct, suppose - - - =h.,~0. Let L; be the corresponding 
a Xa free orthogonal block in Ku, and (Aa,, Ba,) the corresponding 
a rows, then Aq, is of rank a; and contains an a Xa; nonsingular 
minor array G. After an internal permutation suppose A.,=(G, H), 
then the free ZL; may be absorbed by requiring L;G to be the unique 
triangular factor (say with positive diagonal coefficients) of G’G. If 
h,=0 then the process starts with B,,, and if the partition is into more 
than three parts, B,, may be orthogonalized first. If \:=1, then 
clearly (A.,, Ba,) =0. None of these transformations affect the ortho- 
gonality conditions imposed by the reduction of §2. Y and Z are re- 
duced similarly with the freedom in Kz and Kz, if any. 


4. The case »=p+4q. In this case a more detailed description is 
possible and is derived here as a consequence of the following lemma. 
It has also been displayed by Hotelling as a by-product of a statis- 
tical investigation.! 


Lemma. If two real matrices A and B are each a-rowed and b- 
columned, then AA’ = BB’ if and only if A=BU where U is orthogonal. 


Proor. AA’ is an a-rowed symmetric matrix, hence there exists an 


orthogonal V such that VAA’V’=VBB’V’=diag (d,---,d,, 
0, --+-,0), where and each d; is positive. Let D-! =diag (1/(d,)" 
Then D'VAA'V'D-'=D- VBB'V'D-—, 


=diag (J,, 0._,). Write D-'VA=X and D-'VB=Y, so XX’=YY’ 


1 H. Hotelling, Relations between two sets of variates, Biometrika vol. 28 (1936) p. 
321. The present paper is the basis for an extension to more than two sets. 
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= diag (J,, 0,_,). This implies that X has r nonzero orthonormal rows. 
Now to the r nonzero rows add b—r rows chosen so as to present a 
b-rowed orthogonal matrix Xo. Do the same with Y and call the result- 
ing orthogonal matrix Yo. Letting U= Yi Xo, we have Xo= YoU. 
Hence X= YU and A=BU. 


Let 
() 
be the semi-canonical form (1) of a positive definite matrix parti- 
tioned according to n=p+q. Let K be an orthogonal matrix such 
that KXX’K’=D, where D is diagonal. Hence KAA’K’=D—TI, 
= BB’, where B is a p by gq matrix whose first p diagonal coefficients 
are the square roots of those of D—J,. By the lemma there exists an 
orthogonal matrix L’ such that B= KAL’. Then the canonical form 
desired is obtained by using the internal transformation 


K 0 
1): 

5. Applications. These canonical forms have been applied to the 
determination of minimal principal minors of partitioned correlation 
matrices in an unpublished paper by R. G. D. Steel and the author. 
These minors have statistical significance as generalizations of the 
minors 1—}j formed from the matrix of §4, with b; a diagonal coeffi- 
cient of B (see footnote 1). 

One easily proved property of the semi-canonical reduction of §1 
is the following: Among all SPS’ with unit diagonal coefficients and 
fixed? P, the determinant |SPS’| is maximum when SPS’ is in the 
semi-canonical form. For, let P; be the first diagonal block of P and 
let SPS’ be in semi-canonical form. Then S,P:S{ =I,, or | Si|* 


=1/|P,| 21. Similarly | S.|?21, | S;|?21. Hence | S|?21, implying 
| SPS’| =P. 
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? Here P is a correlation matrix. 


NOTE ON THE ESCALATOR METHOD 
BERNARD VINOGRADE 


Practical solution of the characteristic vector problem 
(1) AX = XX, 


where A = (a,;) is a given real nonsingular n-rowed matrix and X is a 
column (x1, - - - , Xn)’, has recently been treated anew by J. Morris.' 
His process is called the escalator method. If A, is a p-rowed prin- 
cipal minor array in the upper left corner of A, then the abbreviated 
system 


is called the pth stage of (1). As given by Morris, the escalator 
method is based on expressions for the solutions of the (p+1)th 
stage in terms of those of the pth stage, and deperds on the existence 
of biorthonormal sets of solution vectors for (2) and the transposed 
system 


(3) = XY >. 


But, of course, it is not always possible to find such biorthonormal 
sets of solution vectors, since the elementary divisors of the pth stage 
may not be linear. In this note a more general exposition of escalation 
is given, covering all cases and leading to at least one practical method 
in cases outside the scope of Morris’ formulas. In addition, a result 
on the transmission of roots from one stage to the next is included. 

Let A; be a characteristic root of multiplicity ”; of the system (2), 
and X,; a corresponding characteristic vector. As is well known, there 
exists a nonsingular Pp» such that 


Ai 6, 0) 
Ai be 


(4) A,Po = PoAo, where Ap = 


Received by the editors December 27, 1948. 
1 J. Morris, The escalator method, Wiley, 1947. 
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the 6,’s being zero or one. If we set 


C = » D = 
(5) A ) P B= 
pti = C = 0 42 » 


then one can easily expand the equation 


(6) | Api — = = 0, 


CP» Gp+1,pt+1 — 


where I, is the p-rowed identity matrix. This equation is the general 
escalator equation for the determination of the characteristic roots 
at the (p+1)th stage. When Apo is diagonal, then (6) coincides with 
Morris’ equation? and is especially effective in symmetric systems. 
Since A; P>’=P,; 'Ad, the transposed system (3) leads to the same 
escalator equation | B’—dIp4:| =0. 

For a determination of Po, one may place X,, in the first column, 
X 2 in the (m2+1)th column, and so on. The remaining coefficients 
and the 6,’s may then be solved from the system (4).’ 

For a determination of the characteristic vectors of the (p+1)th 
stage,‘ let X* be a +1-columned matrix of the unknown vectors 
and let A be the diagonal matrix of corresponding characteristic 
roots of the (p+1)th stage as determined from the escalator equa- 
tion (6). Then from A,.:X*=X*A it follows that P-'A,., PP-'X* 
= P-1X*A, or BZ*=Z*A, where Z*=P-'X* and B=P-'Ayi:P (as 
defined by (5)). Because of the simple structure of B, the equations 
BZ*=Z*<A afford an easy determination of Z*, hence of X*= PZ*. 

As for the transposed system A},, ¥*= Y*A, we note that if Z* is 
nonsingular (which is possible whenever the elementary divisors at 
the (p+1)th stage are linear), it follows from BZ*=Z*A that B’Z*-” 
=Z*-"A, hence Y* = P-"Z*-" = X*" is a solution of the transposed 
system. 

; Pe), Po’ D=(Qa, -++,@Q,)’, then a necessary 
and sufficient condition for a root A, of the pth stage to be a root of 
the (p+1)th stage is P,Q,=0 for any pair P,, Q, arising from a 
column of Py) which is a characteristic vector of the pth stage cor- 


2 The components of CP, and P,'D are Morris’ P,’s and P;'s. 

3 In connection with (4), the escalator method for simultaneous linear systems as 
compactly presented by R. A. Frazer, Note on the escalator method, Philosophical Maga- 
zine vol. 38 (1947) pp. 287-289, is pertinent. 

4 Checks and other special procedures have been developed by R. J. Lambert, 
Master’s Thesis, Iowa State College of Agriculture and Mechanic Arts, 1948. 
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responding to 

For the necessity, let x,4: be the last component of X,,; and notice 
that the systems A pi1Xpi1=AXpy1 and Aji, may be 
written 


(7) (A, Alp) Xp + Dxpi1 = 0, 
CX + — A) = 0, 
(Ay = 0, 


S 


D'Y 5 + — A) 


Hence if P,=CX,,=0, then (7) is satisfied by A=, and X41 
=(X;,, 0)’. Similarly, Q,=0 implies (8) is satisfied by \=X, and 
( Ye 0)’. 

On the other hand, if we left-multiply the p equations (Ay, D)X p41 
by Yj, and recall that AZ then 


(9) (Ae — + = 0. 


Now if (9) is satisfied by \=X, and a corresponding vector Xp41,,, 
then either Q,=0 or xp41=0. If x41=0, then the first components 
of Xp+1,, satisfy the pth stage, hence P,=0. Thus the sufficiency is 
proved. 
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THE MAXIMAL REGULAR IDEAL OF A RING 
BAILEY BROWN AND NEAL H. McCOY 


1. Introduction. An element a of a ring R is said to be regular 
if and only if there exists an element x of R such that axa=a. The 
ring R is regular if and only if each element of R is regular. The con- 
cept of a regular ring was introduced by von Neumann [5, 6]! who, 
however, required also that a regular ring have a unit element. 

Unless otherwise stated, the word ideal shall mean two-sided ideal, 
and an ideal in R will be said to be regular if and only if it consists 
entirely of regular elements of R. It is easy to see that a regular ideal 
A in R is itself a regular ring. For if a€A, there exists an element x of 
R such that axa=a. It follows that axaxa=a and xax€A, so a is 
regular in the ring A. 

We shall show that the join of all regular ideals in an arbitrary 
ring R is a regular ideal, and hence that there exists a unique maxi- 
mal regular ideal = M(R) in R. The purpose of this note is to estab- 
lish a few fundamental properties of M(R). Among these are the fol- 
lowing “radical-like” properties: (i) M(R/M(R))=0, (ii) if B is an 
ideal in R, then M(B) =B\M(R), (iii) if R, is the complete matrix 
ring of order over R, then M(R,) =(M(R)),. A special case of this 
last result is that R, is regular if and only if R is regular. This was 
proved by von Neumann [6], but we shall include a very simple 
proof of this fact. 

It is well known that every regular ring has zero (Jacobson) radical 
J. For the rest of the introduction it is assumed that R is a ring such 
that R/J is regular. We note that this condition is satisfied if, for 
example, the right ideals of R satisfy the descending chain condition. 
Thus M=R if and only if J=0, and hence, in some sense, M may be 
considered as an “anti-radical.” It is shown in §4 that M=0 if and 
only if R is bound to its radical J in the sense of Marshall Hall [2]. 
Moreover, in §5 it is proved that, under the descending chain condi- 
tion for right ideals, R is expressible as a direct sum 


R=M+M*, 


where M* is the ideal consisting of all elements a of R such that aM 
= Ma=0. It follows that M is semi-simple and M* is bound to its 
radical, and thus this direct sum decomposition coincides with one 


Presented to the Society, December 30, 1948; received by the editors January 19, 
1949. 


1 Numbers in brackets refer to the bibliography at the end of the paper. 
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obtained by Marshall Hall [2]. 


2. Existence and simple properties of M(R). Let R be an arbitrary 
ring, and a an element of R. The following lemma plays a central 
role in several of our proofs: 


Lemma 1. If y is an element of R such that a—aya is regular, then a ts 
regular. 


ProoF. If a—aya is regular, there exists an element z of R such that 
(a — aya)z(a — aya) = a — aya. 


If we set x =z—zay— yaz+yazay+y, a simple calculation shows that 
axa =a, and thus a is regular, which completes the proof. 

We shall indicate by (a) the principal ideal in R generated by a. 
We now prove the following theorem. 


THEOREM 1. Jf M is the set of all elements a of R such that (a) is 
regular, then M is an ideal in R. 


Proor. If M and then M since (zt) C(z). Similarly, 
M. If z, and a€(z—w), then a=u—v for some wu in (2) 
and v in (w). Since (z) is regular, u = uru for some element r of R. Then 


a@—ara=u—v— (u— v)r(u — v) = — urv+ oru — 


Since v€(w), this shows that a—ara€(w) and is therefore regular. 
Lemma 1 now implies that a is regular, and hence z—wC WM. This 
completes the proof of the theorem. 

It is clear that M, being the join of all regular ideals in R, and being 
itself regular, is the unique maximal regular ideal in R. It may be 
remarked that the proof of the above theorem is analogous to the 
proof of Theorem 1 in Brown and McCoy [1]. 

We shall next prove the following theorem. 


THEOREM 2. If R is any ring, M(R/M(R)) =0. 


Proor. Let @ denote the residue class modulo M(R) which con- 
tains the element a of R. If M(R/M(R)) and aE (bd), then (5). 
Since (5) is a regular ideal in R/M(R), @ is regular. If d=4aza, a 
—axa€ M(R), therefore a—axa is regular and Lemma 1 implies that 
a is regular. This shows that every element of (0) is regular, and 
hence b€ M(R). Thus }=0, completing the proof. 

Suppose now that B is an ideal in R, and let } be an element of B 
which generates a regular ideal (b)’ in the ring B. Let (0) be the ideal 
in R generated by the element 3, and let 


7 
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c=nb+rb+ bs+ ribs; (m an integer; 1, s, s; in R) 
be any element of (5). Since 5 is regular in B we have b=5b,b for 
some in B. Hence 


c = nb + (rbbi)b + 5(b;bs) + >» (7:bb1)b(bibs;), 


and thus c€(6)’; therefore (6) is regular since it coincides with (0)’. 
This shows that if M(B), then D©B/\M(R). Conversely, if 
bE B\M(R), then 6} is an element of B which is regular in R, and 
it is easy to see that b is therefore regular in the ring B. Since 
BC\M(R) is a regular ideal in the ring B, it follows that B(\M(R) 
C M(B). We have therefore proved the following theorem. 


THEOREM 3. If B is an ideal in R, then M(B) =B/\M(R). 


3. The maximal regular ideal of a complete matrix ring. In this 
section we shall prove the following theorem. 


THEOREM 4. If R,, is the complete matrix ring of order n over R, then 
M(R,) =(M(R)),. 


First we give an elementary proof of the special case of this result 
in which R itself is regular, and therefore R= M(R). This result, 
under the assumption that R has a unit element, is due to von 
Neumann [6]. 


Lema 2. If R is a regular ring, then R, ts a regular ring. 


The proof of this is in two steps, the first being the proof for »=2, 
and the second the extension to arbitrary n. If rCR, let us denote by 
r’ an element of R such that rr’r=r. Now let 


a b 

c d 
be an arbitrary element of R2. If we set 
0 0 

x=(5 

and denote A—AXA by B, a simple calculation shows that 

0 

¢ 


for suitable choice of elements g, h, 4 of R. If 


| 
SC 
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g O 
Y= 


0 0 
C= BYB=( ), 
k 0 


then 


for some element & of R. Finally, if 
Z= 
0 0 


C — CZC = 0. 


we see that 


This means that C is regular and hence, by Lemma 1, B is regular. 
Again applying Lemma 1, we see that A is regular, and this com- 
pletes the proof for n=2. 
Since (R2)2R,, it follows from the case just proved that R, is 

regular, and similarly Ry is regular for any positive integer k. If 

now # is an arbitrary positive integer, choose k so that 2*2n. If 
AER,, let A: be the matrix of R# with A in the upper left-hand 
corner and zeros elsewhere. Now, as an element of R+, A; is regular, 

that is, there exists an element 


(BE R,) 
D E/’ \ 


of R# such that A:XA,=A,. However, this implies that ABA=4A, 
and hence A is regular. The proof of the lemma is therefore complete. 

By the lemma just proved, (M(R)), is a regular ideal in R,, and 
hence (M(R)),C M(R,). Conversely, let A be a matrix in M(R,), 
and let a;; be a fixed element of A. Since (A) is a regular ideal, there 
exists an element X of R, such that A=AXA=AXAXA, and there- 
fore 


P.@ 
for suitable elements ¢,,, 5p, of R. But it is easy to see* that there 
exists a matrix of (A) with ¢,,@,95pq in (1, 1) position and zeros else- 
where, and hence an element of (A) with a,; in (1,1) position and 
zeros elsewhere. Now if 5 is any element of the principal ideal in R 


* See, for - .nple, Lemma 5 of [1]. 
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generated by a;;, it is clear that there exists an element B of (A) with 
b in the (1,1) position and zeros elsewhere. Furthermore, we have 
BYB=B for suitable choice of Y in R, since (A) is regular. But this 
implies that byub=b, and hence b is regular. This shows that aj; 
€M(R), and hence that M(R,)C(M(R))., completing the proof of 
the theorem. 


4. Some additional properties of M(R). By the annihilaior B* of 
an ideal B in a ring R is meant the ideal consisting of all elements a 
of R such that cB=Ba=0. 


THEOREM 5. If M is the maximal regular ideal of a ring R and J is 
the Jacobson radical of R, then M(\J=0, JOM*, MCJ*, and 
M(\M*=0. Furthermore, J is the radical of the ring M* and M is the 
maximal regular ideal of the ring J*. 


Proor. Since J contains no nonzero idempotent element [3, p. 305], 
M(\J=0. From this it follows that MJ=JM=0, so JC M* and 
MCJ*. If ce M\M*, then a=axa for some x. But aE M and xa 
€ M*, hence a(xa) =0 and M(\M*=0. The last sentence of the theo- 
rem follows from the observation of Perlis [4] that if B is any ideal in 
R, the radical of the ring B is just B(\J, and from the analogous 
Theorem 3. 

Following Marshall Hall [2], we may say that a ring R is bound 
to its radical J if and only if J*CJ. 

The next theorem gives, for a class of rings including all those whose 
right ideals satisfy the descending chain condition, a necessary and 
sufficient condition that the maximal regular ideal be the zero ideal. 


THEOREM 6. If R is a ring such that R/J is regular, then M=0 if 
and only if R is bound to J. 


Proor. If R is bound to J, it follows that M=0 even without the 
condition that R/J be regular. For M(/\J=0, and this implies, as in 
Theorem 5, that MCJ*CJ. Hence M=0. 

Conversely, let R/J be regular and M=0. We show first by induc- 
tion that J\J*?=0. Suppose that and that j= aid; where 
a;, b; are in J*. It must be proved that 7=0. In the regular ring 
R/J, &; is regular, so R contains x; such that a;—a,x,a;=j;E€ J. Since 
b;€J*, we have 


(1) G = + = 


If m=1, this implies that =0 since a,C J*. If n¥1, 


\ 
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then 


n—l 


Gnbn = j — >, ashy. 


t=1 


Thus by (1) 


n—1 n—1 n—1 
j = + AnXn > a,b; = (a;x; 
t=—1 t=1 t=1 
But the induction hypothesis asserts that if j= > 72) cid; and ci, d; 
are in J*, then j7=0. Since (aix;—a,x,)a; and }; are in J*, it follows 
that 7=0, and we have proved that J(/\J*?=0. This implies, however, 
that J*? is a regular ideal. For if a€ J**, then in the regular ring R/J, 
the element 4 is regular, that is, for some x, a—axa€ J(\J**=0, soa 
is regular. Hence J*?C M=0, from which it follows that J*CJ since 
the radical contains all nil ideals [3, p. 304]. Thus R is bound to J and 
the proof is complete. 


5. A decomposition theorem. In this section we point out the role 
played by the maximal regular ideal M in a theorem of Hall [2], and 
incidentally give a new proof of his result. 


Lemma 3. If an ideal B in a ring R has a unit element e, then 
R= B+ B*. 


Poor. The existence of a unit element in B implies that B(\B*=0. 
If then ex+xe€B and hence (ex+-xe)e=e(ex+<xe), from which 
it follows that xe=ex and e¢ is in the center of R. Thus the Peirce de- 
composition 
x = ex-+ (x — ex) 


expresses each element x of R as a sum of elements ex of B and x—ex 
of B*, and the desired result is established. 

We remark that a right ideal J in the ring M is a right ideal in R. 
For if J, rE R, then M, hence for some element y of R, aryar 
=ar. But ryar€G M, so arCl. Thus J is a right ideal in R. 

From this remark, it follows that if the descending chain condition 
for right ideals holds in R, it holds also in M. In the presence of this 
chain condition, regularity is equivalent to semi-simplicity. Hence M@ 
has a unit element, and the first sentence of the following theorem is 
implied directly by Lemma 3. 


THEOREM 7. If a ring R satisfies the descending chain condition for 
right ideals, then 
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= M + M*. 
The ring M is semi-simple and the ring M* is bound to its radical. 


The semi-simplicity of M is implied by the regularity of M. 
Since the maximal regular ideal of M* is zero by Theorem 3, and the 
chain condition holds in M*, it follows from Theorem 6 that M* is 
bound to its radical. 

Hall has shown that a ring R satisfying the descending chain con- 
dition for right ideals can be represented in a unique way as the direct 
sum of a semi-simple ring and a ring which is bound to its radical. 
The result just established shows that the semi-simple component is 
precisely the maximal regular ideal M of R, and the bound com- 
ponent is the annihilator of M. 
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PROOF OF RAMANUJAN’S PARTITION CONGRUENCE 
FOR THE MODULUS 11’ 


JOSEPH LEHNER! 
1. Introduction. Ramanujan’s conjecture, that 
(1.1) p) =O (mod 11%), 


where p(m) is the number of partitions of m and p is the least positive 
integer? such that 


(1.2) 24p = 1 (mod 11°), 


has never been proved for general values of a. Proofs for a=1, 2 
were given by Ramanujan and the author.’ The object of this note 
is to supply a proof for a=3, that is, to establish the congruence 


(1.3) (13311 + 721) = 0 (mod 11°), 
The proof in I for ~=1 used a method developed by Rademacher 


(Trans. Amer. Math. Soc. vol. 51 (1942) pp. 609-636). It was effected 
by constructing an identity which, by analogy, would be of the form 


(1.4) > + = 


for general a. Here >> should be a sum, with integer coefficients, of 
certain functions having expansions in powers of x with integral 
coefficients. Now the power series on the left-hand side of (1.4) can 
be expressed rather simply in terms of the Dedekind function 


n(r) = exp (rir/12) II (1 — x”) 
(1.5) 


= exp (wir/12) 41 +> 


where, throughout this paper, x = exp 27ir and Im 7>0. In fact, it is 


Presented to the Society, October 30, 1948; received by the editors September 13, 
1948 and, in revised form, February 9, 1949. 

1 The author is greatly indebted to the referee, who made a very careful review of 
the paper, correcting some errors and clarifying a number of ambiguities. He also 
supplied footnote 2. 

2 Actually p can be given explicitly. It is (23-11%+1)/24 for a even and 
(13 -11%+1)/24 for odd. 

3 Cf. footnotes 2 and 5 in J. Lehner, Ramanujan identities involving the partition 
function for the moduli 11*, Amer. J. Math. (1943) pp. 492-520. This paper will be 
referred to hereafter as I. 
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easy to deduce (I, (7.32)) from the above definition that 


l=0 


= 11-* + 24d)/119) = 119), 


11% 


(1.6) 


say, where « is 1 for even a and 11 for odd a. 

The important fact about L(r; 11*) is that (I, Theorem 8) it isa 
modular function on the subgroup‘ I'9(11) of the full modular group, 
and has its only singularity at the parabolic vertex r=0 of the sub- 
group’s fundamental region. As such it must be a polynomial of the 
form 


(1.7) Qa(A,C) = +--+) +---) 


in the two basic functions A(r) and C(r), whose definitions we shall 
recall in §3. Thus we have 


(1.8) (1 — + = L(r; 11*) = Q.(A, C). 
m=1 

Now the power series of A(r) and C(r) in terms of x have integral 
coefficients, and so Ramanujan’s conjecture for 11* follows if we can 
show that the coefficients of the polynomial Q. are divisible by 11+. 
This was done in I for a=1 by explicit calculation. 

To go on to higher values of a we proved that the functions 
L(r; 11%) can be given inductively by using the linear operator 


(1.9) UyF(r) = UF(r) = Te)» F((r + A)/11). 


A=0 


In fact (I, Theorem 7), if we put L(r; 11°) =1, we have 


(1.10) L(r; = U{ &(r)-L(r; 112%)}, 
(1.11) L(r; 112™+2) = UL(r; 112™*1), 

for m=0, 1, 2, - - - , where 

(1.12) ®(r) = B(r; 121) = 


The nature of the inductive step depends on the parity of a. 

Now UCi, U®Ci-', UACi, U®ACi, j=1, 2, are modular 
functions on I'o(11) whose only singularities are at the parabolic 
vertex zero (I, Theorem 8). Thus, because U is linear, we can go from 


4 Defined by the condition 11 le in the modular substitution r’ = (ar+5)/(cr+d). 
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one of the identities (1.8) to the next provided we know the expres- 
sions for UC’, - - - , as polynomials in A and C. However, if we are 
interested only in proving Ramanujan’s conjecture for a given a, we 
can abbreviate this explicit procedure considerably, as was done in I 
for a=2 and will be done below for a=3. The basic tool will still be 
the inductive property (1.10), (1.11). 


2. Method. We shall find it convenient to replace the rational basis 
A(r), C(r) by the basis C(r), D(r), where 


(2.1) 11D(r) = A(r)C(r) — 1, 


and D(r) has integral coefficients (I, Lemma 4). A modular function 
on I'o(11) whose only singularity is at the parabolic vertex t=0 (for 
example, L(r; 11*)) can be expressed as a polynomial in C and D 
which is of first degree in D. 

Now let us use the notation F(r)=G(r) (mod 11%) to mean that 
corresponding coefficients of the expansions of F(7r) and G(r) about 
4 in powers of x are congruent mod 11*. Then proving Ramanujan’s 
conjecture for a=3 is equivalent to proving that 


(2.2) L(r; 11%) = 0 (mod 11%). 
The operator U affects a power series as follows (I, (4.74)): 
(2.3) u( ans") = >) aumx”. 
n2v m2v/11 


Thus we can say that if the coefficients of the expansion of F(r) are 
integral, then so are the coefficients in the expansion of U F(r). Hence 
from F(r)=G(r) (mod 11%), we can infer that UF(r)=UG(r) 
(mod 112). 

The identity for L(r; 11) is explicitly (I, (6.6)) 


L(r; 11) = 11AC(11C + 2) — 11(11°C? + 32C + 2), 


or in terms of C, D 


(2.4) L(r; 11) = 112D(11C + 2) — 11(11°C? + 210). 
Hence 
(2.5) L(r; 11) = 2-112D — 21-11C (mod 11%). 


Now we apply the second inductive step (1.11) and make use of the 
remark in the preceding paragraph and the linearity of U to obtain 


(2.6) L(r; 117) = UL(r; 11) = 2-11°UD — 21-11UC (mod 11°). 


In the next section we shall prove 


| 
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(A) UC = 11a, C (mod 117), a, = integer, 
(B) UD = 0 (mod 11). 


From these congruences, the truth of which we assume for the 
moment, we conclude from (2.6) that 


(2.7) L(r; 117) = 112a, C (mod 11%). 


Now ®(r) has an expansion in powers of x with integral coeffi- 
cients, as we easily verify from (1.12) and (1.5). Hence, if we apply 
the first inductive step (1.10) to (2.7), we find 


(2.8) L(r; 11%) = 112a, (mod 11°). 


Therefore, the desired congruence (1.3) will be completely proved if 
we can establish (A), (B), and 


(C) U®C = 0 (mod 11). 


3. Proof of (A), (B), and (C). In the proof of these equations, we 
shali make use of the fact, demonstrated in I, §8, that their left 
members, namely, UC, UD, U®C, are entire’ modular functions on 
the subgroup I'9(11) of the modular group, and shall express them as 
polynomials in the basic functions C, D of this subgroup. We there- 
fore recall the definition of C and D. As before, 7 is a complex variable 
with positive imaginary part, and x=exp 27ir. All the functions 
listed below have Laurent expansions in x with integral coefficients. 
For further details, cf. I, §4. 

Starting with the formula for (7) in (1.5), we define in turn: 


(3.1) = {n(r)n(117)}%, 
(3.2) g(r) — 1/1200=2 on) = Dat, 
1 d\n 
(3.3) G(r) = 11°g(117) — g(r), 
(3.4) = > = m? + mn + 3n?, 
(3.5) A(r) = 6°(r)/p(r), 
(3.6) B(r) = G(r)/¥*(r), 
(3.7) C(r) = 2-1-11-*{ A%(r) — 10A(r) — 22 — B(r)}. 


5 An entire modular function is one which has no singularities except possibly at 
the parabolic vertices of the fundamental region, which, in this particular case, are 
located at 0 andi. 


| 

\ 

| 

| 


176 JOSEPH LEHNER [April 


The expansions of the following functions of interest at the para- 
bolic vertices (0, 1) of I'9(11) appear in I (references at right of 
display lines are to equations in I). In order to avoid confusion we 
write (UF|r’) to denote the result of replacing rt by 7’ in the right 
member of (1.9). By P(x*) we shali mean a series in powers of x, 
with integral coefficients, beginning with a term in x’. 


(3.8) A(r) = x17+6+4+--- (6.41), 
(3.9) C(r) = x+5x7+--- (6.43), 
(3.10) A(—1/11r) = A(r) = +6+4+--- (4.53), 
(3.11) 11°C(—1/11r) = + 2214+--- (6.46), 
(3.12) UC(r) = P(x), (8.2), 
(3.13) 11°(UC| —1/11r) = 11°C(—1/121r) + 11°P(x) 

(8.81), (9.31). 


The first terms of the expansions of D can be deduced from (2.1), 
(3.8)—(3.11): 


(3.14) D(r) = 
(3.15) 11°D(—1/lir) = +---. 


By Fermat’s Theorem and an elementary theorem on binomial 
coefficients, we observe that 


(3.16) = {11°9C(—1/11r)}" +4 11P(a-*); 
consequently, in view of (3.13), 
(3.17) 11%UC| —1/11r) = {11°C(—1/11r) }™ + 11P(2-*). 


The argument in the first paragraph on p. 505 of I shows that an 
entire modular function on I'p(11) which vanishes at infinity and has 
at most a pole of first order at the vertex r = 0 must be identically zero. 
We apply this principle to the function 


A(r) = 11°UC(r) — {11°C(r)}* — {11%C(r) + 11480D(r) 


10 9 
+ 112%4eCi(r) + 11°D(r)- 
j=? j=1 

where the a, 8 are rational integers chosen so that all terms of the 
principal part of A at r’= —1/11r=0 drop out except possibly the 
term in x~'. This choice is clearly possible in view of (3.17), (3.11), 
and (3.15). Furthermore, all terms of A vanish at r=i~, as we see 
from (3.9), (3.12), (3.14), and therefore A itself. It follows that A(r) 
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is identically zero, and we infer the truth of the congruence 
(3.18) UC(r) = + 118,D(r) (mod 117). 


Now writing 


(3.19) 
1 1 2 
and remembering (2.3), we obtain the following linear congruences for 


a and Bo on matching the coefficients of x and x? in the two members 
of (3.18): 


C10, + = cy, 

Coat, + 11d2Bo = Coz 

or using the values in the tables of §4, 
ay + 118) = 44 
Sai + 668o = 99 


(mod 11%), 


(3.20) (mod 11%). 


Therefore, we have 
a, = 6(44) — 99 (mod 117), 
a, = 0 (mod 11); 

118) = 99 — 5(44) = 11(9 — 20) (mod 112), 
Bo = 0 (mod 11). 


(3.21) 
(3.22) 


With (3.18), (3.21), and (3.22), we have completed the proof of (A). 
The proof of (B) is quite similar. From I, (8.81), and (3.14), (3.15), 


114UD| —1/11r) = 118D(—1/1217) + 119P(x) 
= 733 


(3.23) 


and, as before, 

(3.24) 11“UD| —1/11r) = {11°D(—1/117)}" + 

Now employing the same procedure used for UC, we obtain easily 
114UD(r) = a11°C(r) + Bol1*D(r) (mod 115), 


with integral 8o. Equating coefficients of x, we have ai: +118,=0 
(mod 11), that is, a,=1la. Hence, 


(3.25) UD = oC + BD (mod 11), a, Bo integral. 


We now again equate coefficients and get the congruences 
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=d 
a+ Bo (mod 11) 
Sa + 68o = dz 


for the determination of a and fy. The determinant of the coefficients 
is seen to be equal to unity. Hence 


(3.26) 


dy dy 
(3.27) 2= = 0 (mod 11), 
dz 
since dy, =d2,=0 (mod 11). Likewise, 
dy 
(3.28) Bo = = 0 (mod 11). 
C2 dee 


This proves (B). 

For the proof of (C), we need the expansions 
(3.29) = 
(3.30) 11@(—1/1217r) = = 


The first is computed immediately from (1.12) and (1.5), whereas 
(3.30) is a consequence of the well known transformation equation 


(3.31) n(—1/r) = 


where the square root is positive when 7 is on the positive imaginary 
axis. Then by virtue of (2.3) we may write 


(3.32) USC = P(x). 


We must now find the expansion of U®C at r’=0, that is, 
(USC| —1/117r). We cannot use I, (8.81) this time, for ®C is not a 
function of I'9(11). Instead, we have recourse to I, p. 509, in which 
the function G(r; 7, 7, k) = U®‘AiC* is considered. (The last two dis- 
play lines on p. 509 are in error: the line at the bottom of the page 
should have the factor « multiplying the right member; the display 
line above this requires the factor® 1/@(7/11) affecting the sum in 
the right member.) The case we are interested in is 7=0, i=k=1. If 
we substitute the value of the sum given at the bottom of the page 


in the expression for 11G(—1/r) appearing immediately above, and 
replace by we obtain 


11(U@C | —1/11r) = {116(—1/121r)} {11°C(—1/1217)} 


— >> > exp { + 


m=5 p=1 11 


° This factor is aiso required multiplying the sum in the right member of (8.31). 


\ 
| 
| 
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with the Legendre-Jacobi symbol 


> 
11 


From the last equation we deduce, in view of (3.29), (3.30), and 
(3.11), 


114(U@C | —1/1ir) = {116(—1/121r)} {11°C(—1/1217)} 
+ 113P(x) = +-- 


(3.33) 


We therefore obtain in the usual way’ a congruence 
(3.34) 114U@C = + l*C? + 1178 (mod 115) 


with rational integers a, 8. We want to show that 11%|a1, 11] fp. 

This, however, is a consequence of the following congruences obtained 

by matching the coefficients of x and x? in the two members of (3.34): 
a, + 118) = 0 


3.35 od 117). 
( ) 5a, + 6685 = 0 


Therefore, a= 11%, 8o=118, and we have from (3.34) 


(3.36) USC = aC + aC? + BD (mod 11), a, a, 8 integral. 
(3.36), in turn, yields the congruences 
a + Been 
(3.37) Sa+ az+ 68 = ess (mod 11), 


19a + 10a2 288 = €33 
where we have written 
(3.38) @(r)C(r) = en x. 
n=6 


The determinant of the coefficients is —1. Since e)=e2=e3=0 
(mod 11), we have 


0 1 
1 £46 =0 (mod 11), 
€33 10 28 


7 The analogue of (3.16) and (3.24) is not valid for the function $C, since &(7) 
is not a function of I'9(11). The equation we would like to write, 11@(—1/121r) 
-112C(—1/121r) = {114(—1/1ir) is obviously false, 
for (3.30) shows that @(—1/11r) does not have an expansion in integral powers of x. 
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and likewise a,.=820 (mod 11). This establishes (C), and with it, 
the truth of the congruence (1.3). 


4. Tables. The preceeding discussion has utilized the values of 
certain coefficients, which will be developed in this section. In the 
computation of these, the following formulae, in addition to those 
given previously, will be found useful: 


(4.1) Ia > (—1) /2 (Euler), 
(4.2) i+ p(n) x" (Euler), 
(4.3) = (I, (4.61) and footnote 16) 
with® 


11h 2, 


Ag(n) = 4 “ln 
Ag(n/11), n, 


where d,;=quad. res., d.=quad. non-res. (mod 11). 

The coefficients of the functions C, D, and ®C are computed in a 
straightforward way from the definitions (1.5), (1.12), (3.1)-(3.7), 
(2.1), with the help of (4.1)-(4.3). The values are shown in the fol- 
lowing tables to the highest modulus required for the proofs of the 
preceding section. 


RESIDUE OF THE COEFFICIENTS OF CERTAIN POWER SERIES 


Function 
of ala |v | G B C|D\@ 
modulo 0 
~2 1 1 | 1 4 
—1 1 1 12 4 2 
0 1 6 70 14 — 12 
1 5 4 17 296 40 —2 — 116} 1 | 1 | 
2 10| 4 46 | 1,073} 105 -18| — 597| 5| 6} 
3 20; 8 116 | 3,460 252 —56 | —2,298 | 19 | 28 


* The formula for Ag(”) was printed incorrectly in I. 
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t, Function 
Power ae | i | | | | | 
A 2 
ie 
“4 | modulo 2-114 = mod 11 
| 
4 36 | 252 110,150 574 | — 146 |— 7,616 | 63 
5 65 | 16| 533 |27,704| 1,240 | — 252 |—22,396 | 64 1 
6 | 10} 32 | 1,034 = 2,580 | — 504| 27,114 | 18 i 
7 185 | 16 | 1,961 |26,858 | 5,180 | — 688 | 22,010 | 60 2 
), 300 | 36 | 3,540 |23,282 /10,108 |— 1,170 | 12,082 | 21 3 
9 481 | 28 | 6,253 |27,176 |19,212 |— 1,514 | 6,028 | 71 5 
10 754 | 40 110,654 | 207 | 6,427 |— 2,268 | 18,539 | 89 7 
11 | 1,169 | 4 (23,232 | 6,452 |— 2,422 | 9,306 | 44 0 
| 12 | 1,780 | 64 | 2 |25,108 |28,404 |— 4,088 | 4,276 | 86 4 
13 | 2,685 | 40 |17,983 | 3,456 |28,712 |— 4,396 770 |115 0 
| 14 3,996 | 64 |16,304 |20,971 | 2,641 |— 6,192 | 24,911 | 36 8 
15 | 5,894 | 56 30 |13,030 |19,004 |— 7,056 | 16,602 |105 9 
16 | 8,600 | 68 | 4,916 | 1,420 |23,952 |— 9,362 | 18,326 | 90 i 
17 [12,450 | 40 |12,024 |20,348 | 364 |— 9,828 | 12,154 | 21 
18 |17,860 | 100 | 5,352 | 609 |17,447 |-13,626 | 23,561 | 47 2 
a | 19 25,442 | 48 | 5,288 |15,722 |26,482 |—13,720 | 7,612 | 57 3 
' 20 | 6,682 | 104 | 9,118 |29,184 |23,816 | 10,886 | 3,102 | 94 0 
21 21,237 | 80 [26,561 28,386 | 1,102 | 10,018 | 3,324 | 95 0 
22 |11,926| 4 |21,362 | 5,929 17,083 | 7,484 | 2,607 | 99 0 
23 | 9,954 56 | 8,592 5,160 |22,928 | 4,946 | 22,574 | 57 0 
modulo 2-113 modulo 11 
24 |1,792|144| 2,376) 220| 1,846| 1,190] 3 6 
25 | 1,421 | 68 | 1,435 | 1,718 | 2,174 442 | 1,888] 6 0 
26 | 2,436 | 88| 310} 893 | 1,447 | 366 | 2,391 | 3 0 
; 27 | 2,544 | 104 | 1,118 | 2,562 138} 1,712 94/ 8 1 
28 | 1,012 | 858 | 384 | 1,040 354 758 | 7 1 
29 | 2,125 | 72 745 | 1,798 2 
30 176 384 0 
di, (mod 11), c” =0, =1, =10 
€11 =€22=e33=0 (mod 11), where C= 
¢31=5, ¢32=7 (mod 11) 
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A GENERAL ASYMPTOTIC FORMULA 
FOR PARTITION FUNCTIONS 


NELSON A. BRIGHAM 


1. Introduction. Practically all partition functions that have been 
considered have generating functions of the form 


1 


where the real part of s is positive and 


(2) F,(z) = te y(n) ~ Kx* log? x, y(n) 20,K>0,u> 0. 


Here a,(0)=1 and for k positive 
a,(k) 
(3) = ll v(m) + 1] + 


! 
km n=l k,,! 


Thus the partition k=1k,+2k.+ - - - +Jk; of k into ones, twos, 
and so on, is weighted in a certain way by the product shown, and 
various partition functions are obtained by choosing the function 
() appropriately. For example, if y(”)=1 for all m, then a,(k) be- 
comes the unrestricted partition function; if y(m) =m for all m, then 
a,(k) becomes the plane partition of Wright [2];! if y(~)=1 for rth 
powers and 0 otherwise, then a,(k) is the number of partitions of k 
into rth powers; and so on. 

The purpose of this paper is to establish the asymptotic formula 


1 
log A { Kul'(u + 2)¢(u + 1) } 1/(u+1) 


(4) 
{ + 1} /(ut1) pul Jog 
where 
k 
(5) A,(k) = a,(n). 


n=0 


We prove (4) by a method used by Hardy and Ramanujan [1] to 
obtain some special cases thereof, namely, we first prove that for s 


Received by the editors December 8, 1948. 
* Numbers in brackets refer to the references at the end of the paper. 
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tending to zero through positive values 


1\* 
(6) log g,(s) ~ x(—) (log T'(u + 1)¢(u + 1), 


and then apply a Tauberian theorem proved by Hardy and Ramanu- 
jan in their paper. 
If a,(%) is nondecreasing with increasing & from a certain point on, 
A,(k) + O(1) 
k 


(7) 


S a,(k) A,(k) 


and hence (4) is true for log a,(k). This is the case, for example, if 
y(1) 21. Formula (4) is obviously true for a,(k) under wider condi- 
tions than monotonicity, but on the other hand it is not always true 
for a,(k); for example, it is obviously false if y() is 1 for even m and 
0 for odd n. 

Furthermore:? Suppose that y(m) 21 whenever it is nonzero. Sup- 
pose that the set S of positive integers for which y(m) #0 has the prop- 
erty that all positive integers not less than c can be partitioned into 
integers from S. Then (4) is also true for log a,(k). 

To prove this observe that a,(k) 2a,(m) for k—c2m21. Hence 


A,(k — c)/(k — c) a,(k) S A,(k) fork >c 


and the conclusion follows from (4). 

As a result of this, one can deduce asymptotic formulas for the 
logarithm of any unweighted partition function, provided only that 
the set of integers used is such that all sufficiently large integers can 
be partitioned into integers from the set. In particular, this is cer- 
tainly true if the set of positive integers contains two coprime integers. 

An important lacuna in the literature on partitions is the absence 
of an exact or asymptotic formula for the number T(z) of partitions 
of k into primes. From (4) and the prime number theorem we get 
only 


k 1/2 
8 log T(k) ~ — 
(8) og T(k) 


which is given by Hardy and Ramanujan (formula 5.281 of [1]). 
In the paper following this one we consider a weighted partition 

function a(k) involving primes, namely that obtained by choosing 

(mn) in (1) as the A(z) of prime number theory. We obtain there an 


2 The author is indebted to the referee for this observation, which materially ex- 
tends the scope of the paper. 
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asymptotic formula for A(k)= )-%-, a(m) under the assumption of 
the Riemann hypothesis. From (4) follows the weaker result 


(9) log A(k) ~ 2(2k/3)'/? 


without any unproved hypothesis. Since proof of monotonicity of 

a(k) would seem to be extremely difficult (if true), we cannot assert 

(9) for log a(k). Note that (9) is the same as the asymptotic formula 

for log p(k), where p(k) is the number of unrestricted partitions of ( 
k. The correspondence follows from the fact that 


> A(n) ~x~ 1. 


naz 


Since also 


log p~ x, 
paz 
the powers of the primes may be disregarded in (9). 
Besides (8) and (9) we give some further applications of (4) at the 
end of the paper. 


2. Proof. In the following s is real and positive. Both “o” and “O” 
refer to s tending to zero. 
From (1) we obtain 


1 
log gx(s) = v(m) log ——— = — ¢(ms) 
n=1 m=1 ™ 
where 
= 


n=1 


We find an asymptotic formula for ¢(s) for s tending to zero, and 
use it to find an asymptotic formula for log g,(s). This is the previ- 
ously mentioned technique of Hardy and Ramanujan [1, pp. 245- 
261]. 


+ y(2)e-** + + + - 
— {ert — — {y(1) + 7(2)} — 
— {y(1) + (2) + ¥(3)} — et} 


¢(s) 


-- fa 
q=l n=l 
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= f F,(y)se~*¥dy -f F, (=) e-*dx. 
1 
Thus 
s gift 


F,(x) is an increasing function, hence in the first integral of (10), 
% 1 1 1 

F,(=) log? —, fors<—, 


and so 


1 
| f F, (=) e*dx = 0f log” for s tending to zero. 
8 s s 
s may be taken so small that, in the second integral of (10), 
x\° 
where | pr(x) | <e uniformly in x, x2s/?. Then: 


1 
= 0f log’ 
x u x 
+ (1 + o(1))K (=) (log =) e-*dx. 
az \s 
Put 


= (=) (tos = = $,(s) + &,(s). 


For ®,(s), —1/2Slog x/log (1/s)$1/2 and 


=(1 ~) (14 log x 
s log (1/s) 


log (1/s) 


where | p2(x)| <C,. Therefore, 


(11) 
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= {—} (log — = 
1 u 1 v oo 1 u 1 v 2 V2 
= (—) (tog -) f x“e—*dx — (-) (ios f 
0 AY AY 0 
1 v 
(10g -) f 
s 1/si/2 


i] 1/2 


u 1 v1 I/s 
) log -) f x“(log x)po(x)e-*dx. 


1\* 1\? 1 
,(s) = — log —} +1) +0 log? — 
s) 
1 u 1 v1 
+ of1) — — 


Also, =0(1). 
Substituting ®(s) = $,(s)+4.(s) into (11), we have 


¢(s) = log? 
+ {1+ o(1)} {«(—) + 1) ¢ 


+04 log? + o(1 ) 


$(s) ~ (=) (08 +1 


1 
log gx(s) = 2) —o(ms) = 


M<m<i/s¥/2 1/s/2S mS 1/s m>1/s 


= + + o3(s) + ¢y(s), 
where M is such that }\n»41/m*t!<e (M(e) is fixed). 


Hence 


Now 


| 
| 
| 
1 
We find: 
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First 
M 1 
= — o(ms) 
m=1 
1 


— {1+ o(1)} {x(—) (10e —) + 
ms ms 


But log m/log (1/s) <1/2 for ¢:(s) and 
log m \* 
(tog —) (10g log 
-( ~) (1+ log m R. 


where | R,(m)| < C,. 


Thus 
u M 
oi(s) = x(-) (tog T(u+1)>> = 
u v1 M 
+ x(-) (108 —) rut {1 + 0(1)}. 
AY m** 


But M is fixed, and so 


1 u 
= x(—) (log ~) P(u + 1)¢(u + 1){1 + 0(1)} 


| Secondly, for ¢2(s) we have 


1 \* 
| o(ms) < (—) (toe —) 
ms ms 


and since M<m<1/s'/2 


1 v 1 v 
(10g —) s (tog -) for all v. 
ms 


| 
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Thirdly, for ¢3(s) we observe from the original definition that 
o(m, s)<(mzs) if m,>m2. Hence we may use for all m the estima- 
tion for the smallest m, namely m=1/s"/?, and find 


1 u/2 1 i/e 1 
os(s) < Cus (—) (10g > 
1 ™ 


Finally, in ¢4(s), ms>1, and the definition of ¢(ms) readily yields 
¢4(s) =O(1). See (3.544) of Hardy-Ramanujan. 
This gives 


or, 


1\* 1\* 
log g,(s) = x(— (ios —) T'(u + 1)¢(u + 1){1+ o(1)} 


+ 


and we have 


(6) log g,(s) ~ K (=) (oe + 1). 
AY 


Now we quote, for reference, Hardy and Ramanujan’s Theorem 
A [1, p. 252]: 
“Suppose that 


An 
Ar & 0, An > An —1;4,2 0,4 >0,a >0; 


n—1 


D2," is convergent for s>0; 


— | 
(2) 
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f(s) = = exp| {1 + o(1) } {iog—\ "| 
when s—0. Then 


= exp [{1 o(1)} Bas (log 


Applying this theorem with: 


f(s) = gy(s), A = KT(u+ 1)f(u+ 1), 
A =n= k, B= 
a= 


we obtain (4) at once from (6). 


3. Further applications. A few applications were indicated in the 
introduction. 

As a matter of interest we give some further applications. 

A fairly general type of partition is obtained by using only com- 
ponents of the form: 


(12) m= do + aym + +--+ + am’, 


where the a’s are such as to give positive integral m for positive inte- 
gral m. For simplicity we also assume the polynomial to yield m,>-2 
for m,>mz2. The m’s themselves may be restricted in various ways. 
Let m, be the largest integer which gives m Sx in (12). Then 
rl 
dot + am 
Sx + + + a,(m + 1)". 


As x—> ©, m,— ©, and so for x tending to infinity we have 


r t— r 1 r 
x = a.m, + O(m, = (1 o(—)) = a,m,(1 + o(1)), 


my, 
x l/r 

m, = (=) (1 + o(1)). 
a, 


Let m be restricted in some manner such that 
(13) xmy(log m) p> 0. 
ms 


Obviously, 1 <1. But when only the restricted m’s of (13) are used 
in (12), 1= > 1. Thus: 
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K alr 
(14) (log x)’{1 + o(1)}. 


We put y(”) =1 when z is of the form (12), and y(n) =0 otherwise. 
Comparing (14) with (2) and (4), we have the theorem: 

If A,(k) is the summatory function for the number of partitions 
of k into components of the form 


= Ay + aym + aem? + --- + am’, 
Nn, > neo for m, > mo, 
where > 1~xmi{ (log m1)’, then 


log A,(k) ~ pel (rte) (u—rv) / (r+) 


r/ k ul (r+u) 
r Tr a, 


We apply this theorem to the following cases: ( 
(A) Let m be unrestricted. Then x=1, u=1, v=0 and 


1 1 r/(r+1 f ) 
log A,(k) ~ (r + + 2)3(— + 
r r ay 


Hardy and Ramanujan’s result for partitions into rth powers of 
positive integers [1, p. 259], is a special case of this. (Wright [3] gives 
more detailed results.) 

(B) Let m be restricted to primes. Then kx=1, n=1, y= —1 and 


1 1 r/(r+1) 
log A,(k) ~ (r + »{r(— + 2)s(— 
r r 
k 1/(r+1) \ 
a, 


Again, Hardy and Ramanujan’s result for partitions into rth powers 
of primes [1, p. 260] is included as a special case. 
(C) Let m be square-free. Then and 


log A ,(k) 


6 1 1 ri(r+1) ¢ B 1/(r+1) 
~(rt+ r(— + 2)3(— + 
r r dy 


In particular, for partitions into square-free numbers: 


log a,(k) ~ 2k}/?. 


f 
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(D) Let m be a polygonal number of order g+2 (¢21): 
m = 1+ — 1)/2; 


mm; q 
and 
log A,(k) ~ (27 + 


2\1/2 1 1 2r/(2r+1) k 1/(2r+1) 
q 2r 2r a, 


In particular, for partitions into triangular numbers: 


3 Gx) 3 
0g ~ (6r) ‘(> 


(E) Various combinations may be taken, with the answer of higher 
order dominating. For instance, partitions into square-free numbers 
and triangular numbers, with duplications omitted or not, as desired, 
gives the result: 


Then 


log a,(k) ~ 2k}/?, 
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ON A CERTAIN WEIGHTED PARTITION FUNCTION 
NELSON A. BRIGHAM 


1. Introduction. In the introduction of the preceding paper we re- 
marked that no one has given an asymptotic formula for the number 
of partitions into primes. As a step in that direction we now give, 
under the assumption of the Riemann hypothesis, an asymptotic 
formula for the summatory function A(k) of a certain weighted par- 
tition function a(k) involving primes and powers of primes. 

As pointed out in the introduction of the previous paper, the gen- 
erating function of the type of partition we have in mind is given by 


1 
(1) g,(s) = Il a,(k)e-**, 


ami (1 — k=0 


where y(n) 20, R{s]>0. Using the Mellin formula for e~* we get from 
1 


(2) log gx(s) = — 


1 3/2+008 y(n) 


If y(”) has been chosen in such a manner as to yield an “amenable” 
Dirichlet series, the calculus of residues will give more or less useful 
information concerning g,(s). For example, if y(~)=1, then a,(k) 
= p(k), and the method leads directly to the usual transformation 
formula for the generating function of the unrestricted partition 
function. If we consider the weighted partition function involving 
primes and powers of primes obtained by choosing y(”) =A(n), we 
have 


and the properties of the latter function are well known. For this 
choice of (m) we drop subscripts on a,(k) and g,(s) and write 
A(k) = a(n). 
nk 

If we were to choose () as 1 if m is a prime and 0 otherwise, then 
a,(k) would be exactly the number of partitions of k into primes, but 
we would have to deal with the less tractable function >> p~. 

Using (2) with y(m)=A(n), we obtain by means of the calculus of 
residues an asymptotic formula for g(s) as s tends to O within an 
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angle of the form | arg s| $0.<w/2, namely 


2 
(3) = Kase * exp (= +266) + 0(/51)), 
where 
x? 1 
(4) D-), 
24 ep 
and 


> — p) cai + 


5 Z(s) = = 
ps? sin (xp/2) s? 


the summations being taken over the complex zeros of {(z). We shall 
show that Z(s) =0(| s| —1), so that 1?/6s is the dominant term in the 
exponent of e in (3). Now if we assume the Riemann hypothesis, we 
can show that g(s) satisfies the conditions of a Tauberian theorem of 
Ingham [2]. Applying this theorem, we get 


(6) A(k) ~ exp ( {= + oZ(c) — z'(o)\), 


where K,= (3/27)/*K2/z, and @ is the positive solution, existent and 
unique for & sufficiently large, of 


1 /x? 
\ 6 
As k tends to infinity, ¢ tends to 0 and 
1 (6) 1/2 
oZ(c) 0, o°Z'(c) > 0, —~ 


in conformity with (9) of the previous paper. In contrast to (9) of 
the previous paper, the powers of primes make an essential con- 
tribution to (6). 

In order to get an asymptotic formula for a(k) we would need to 
apply Ingham’s theorem to 


tak) — a(k — = (1 — *)g(s). 
k=0 
However, the use of Ingham’s theorem would require that we know 


that a(k) is nondecreasing for large k, which seems difficult to estab- 
lish (if true). 
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In the Hardy-Ramanujan Farey-arc method [1]', one would con- 

sider 
Il 1 
h(x) = 

(1 en) Atm) 
in the neighborhood of its singularities at the points e?***/*, where 
a and 6 are integers. In our case we have x=e~*, R[s|>0, and con- 
sider the approach of s to zero. This corresponds to considering only 
the most important of the singularities, namely that at x=1. 


2. The asymptotic formula for g(s). Taking the logarithm of (1), 
with ~(m)=A(m), we have 


o 
log g(s) = 


n=1 m=1 


where s is complex, | s| S1, |arg s| S0)<7/2, and log g(1) is real. 
Using the Mellin transform, 


— 
2rid 


we find 
1 Tis) Ale) 2. 1 
log g(s) = 2, =; ds, 
or 
—1 + 1) 
(8) log g(s) = — 


J s* ¢(z) 


Now we consider 


where the integral is taken around the rectangle 3/2 —T7,i, 3/2+T7,i, 
—3/2+T,i, —3/2—T,i. T, [3, pp. 340-341] is an ordinate between 


the positive integers g and g+1 upon which ¢(z) has no zeros, and 
for which 


+ T,i) 
f(a + T ,i) 


3 
2 


(10) 


IV 


3 
<C 


1 Numbers in brackets refer to the references cited at the end of the paper 
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Note: We put z=x+-yi, since we later wish to follow Ingham [2], 
who uses s=o+/i. We then put y=T, for the particular set of 
ordinates in question, following Landau [3]. 

We estimate the integral over the end segments (3/2+T7,i, 
—3/2+T,i) and (—3/2—T,#, 3/2—T,#). On these segments we have, 
in addition to (10), |¢(z+1)| SCT? [5, p. 276]. For fixed x, and 
y— + ©, it is easy to show that (cf. [4, Satz 299]) 

| T(z)| S 
Putting (s fixed) 
(11) = re 1, S 9% < 
we have |s~*| <Ce™, 

Thus the absolute value of the integral (9) over the end segments 
is less than or equal to 


C(e-*7e!?T ,)(T (log? T,) C exp (T,(80 — 2/2) + 4 log T,). 


With 7/2—0@,=0,>0, we may absorb the 4 log T, by using as an 
exponent @,T,/2. We obtain as our final estimation, 


6>0, T,>2. 


If we denote the sum of the residues within the rectangle of integra- 
tion by 27,, (9) becomes 


J 3/279: s* ¢(z) 
—1 p-3/2-Tei T(z 1) 
(12) + —f + 1) dz + = 
J 3/247 9i 


Next we compute 2r,. For this purpose we alter the integrand of 
(12) by means of 


(13) ¢(1 + 2) = 2(2n)t(—2) cos [3, p. 285], 

and 

(14) = [S, p. 239], 


which gives 


w(2m)* 1 


il 
| 
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At z=1, z= —1, and at each of the complex zeros of ¢(z), (15) has 
a simple pole. (15) has a double pole at the origin. 
We compute the residues at the simple poles first. 


Since 
1 
¢(z) z—1 
the residue at z=1 is 
1 
(16) —#(2x) — §(—1) = — - 
6s 


The sum of the residues at the complex zeros of {(z) is 
p(sin (xp/2)s? 


(17) 


where p=8+7i runs over the complex zeros of {(z). Each zero is 
counted to its multiplicity. 
Since ¢(z)=1/(z—1)+ ---, the residue at z= —1 is 


1 ¢(-1 
(18) = O(r), r— 0. 


In order to obtain the residue at the double pole at the origin, we 
use the following expansions (P(z) denotes a power series convergent 
for |z| small enough) 


(=) =1+2 log — + 2?P(z), 


1 2 
— 
1 1 
- <(1 - + log? 2x — + 2°P(z), 


1 
= - log 2x 


1 1 
+ log? 2x — >> + 


24 


— 
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1 
f(z) 


The expansions for {(z) come from 


= — 2+ 2(log 2x)z + 2*P(z). 


and 


where b=log 2r—1—7/2, y being Euler’s constant. 
We find that the residue at z=0 is 


24 


Inserting (16), (17), (18), and (19) in (12), and allowing T,- ~~, 
we obtain from (8) 


log g(s) = — f ds+—+Z 
og g(s) + + Z(s) 


1 
(19) log 2x logs -1+—+ >> 
p? 


wt 


(20) 
1 

+ log 2x logs -1+—+ 
24 


where Z(s) is given by (5). 

In the integrand of the absolutely convergent integral which 
appears in (20), (11) gives | s~=| =r3/2e-%, Thus the integral is 
O(r*/?) = O(r) for r tending to zero, and so we get the transformation 
formula 


1 
(21) log g(s) = —+2Z(s) + log 2x logs —1+—+ —+O(r). 
6s 24 


This gives (3) at once. 

Since 1/(1—e-*") =(1/sm) {1+0(1) } as | s]| 0, each factor of (1) 
(with y(m) =A(m)) for which A(m)#0 makes a contribution of impor- 
tance to (3). Thus the powers of primes make an essential contribu- 
tion to the transformation formula, and hence ultimately to our result 
for A(k). The “size” of this contribution is readily seen: Write 


1 1 
a(s) = gi(s)g2(s) = (I (IL. 
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and since ) p=<z.mz2 log p~x"?, (6) of the preceding paper applied 


to gi(s) and g2(s) separately gives 


wz? 73\ 1 
(22) g(s) = ep (= (14 += 


Comparison of (21) and (22) shows that lim,.orZ(r)=0. For 


future use we prove more, namely 


(23) lim r*+1Z()(s) = 0, 


r—0 


where Z“(s) means the nth derivative, 


Using the first form of Z given in (5), we have, with p=6+~7i, 


| | ¢(—p) | | + ) (p 1) | 
p |p| -| set»| -| sin xp 
But 
= | +1)--- +n-—l1 
| | - | ple +1) = (5, p. 276], 
| 
and 
1 1 1 C eolrl 
while 
1 
Cg 
| sin (rp/2) | 
Using 7/2—0)=8, as before, we have 
| Z()(s) | <Cr 
r 
But 
8s 1--——» |3, p. 321], 
log 
and so 
eAlri/2 


* The second factor of the numerator is to be taken as 1 in case m is zero. 


| 
| 
| 
| 
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| 


1950] ON A CERTAIN WEIGHTED PARTITION FUNCTION 199 


It is well known that the number of complex zeros of {(z) with 
imaginary part between T and T+1 is O(log T) and the number 


with imaginary part less than or equal to T is O(T log T) [3, pp. 337- 
338]. Hence 


| Z ™)(s) 


< C(T log + C 


n=T 


IA 


log? T + log T log log T + a logr 
C exp ) + Ce 
log T 


Put T= —log r and both terms —0 as r—0. This establishes (23). 


3. Application of Ingham’s Tauberian Theorem. We see at once 


that 
g(s) = f e“*dA(u) = sf e-“*A(u)du, 
0 0 


where A(u) =A(k) for kSu<k+1, and we now are in a position to 
apply Ingham’s theorem [2]. Since it is an integral part of what fol- 
lows, we quote the relevant portions of this theorem in their entirety. 
All statements apply for o sufficiently small (or w sufficiently large). 
Open segments: (a, b); closed segments: [a, b]. 

We quote: “Let there be given two functions ¢(s) and x(s), and a 
domain D of the s-plane, satisfying the following conditions: 

a.) $(s) and x(s) are regular in D, and real and positive on a seg- 
ment (0, i] of the positive real axis lying in D; 

b.) —o@’(c) tends monotonically to infinity as o tends to0---; 


5(c)/o 
c.) tends to as a tends to 0, 


where 5(c) is the distance of the point o from the complement of 


d.) =0{¢"(0)}, 
x(o+z) =O{x(c)} 


uniformly for |z| <4(¢) when o tends monotonically to 0. 
Define 


= x(s)e*™, Fo(s) = 


| 
| 
— 
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where o =<, is the solution [existent and unique for sufficiently large 
w, by (a.) and (b.)] of 


Ao(w) = 


—¢'(c)=w4 (O<oS h). 


In these circumstances we have: Suppose that the integral 
f(s) -f e-“*dA(u), A(0) = 0, (s = a+ it) 
0 


is convergent for o>0, and that 

(i.) f(s)~fo(s) when s—0 in D; 

(ii.) f(s) =Of{fo(|s|)} when s—0 in some fixed angle ‘A’ of the 
form |¢| <Ac, (0<A< 

(iii.) A(u) is increasing (in the wide sense) for 120. 

If (ii) holds for every fixed A, then 


A(u) ~ Ao(u) as 
We note that Ingham’s theorem has A(0) =0, whereas it has been 


convenient for us to write A(0)=1. This does not affect the result. 
We desire to apply Ingham’s theorem with 


x(s) = 2r 


and 


¢(s) = bs + Z(s). 


It must therefore be established that the conditions of Ingham’s 
theorem are satisfied by our functions. 

For D we choose the region bounded by the rays |a| =7/4 and the 
arc | s| = 1. 6(c¢) becomes merely the distance from ¢ to the boundary 
of D. From a figure we see that: 


= 21/2¢/2, o 1/2. 


Hypothesis (a): 

o(s) and x(s) are regular in D. 

x(s) is real and positive on the segment (0, 1]. 

In virtue of (23) and the fact that the complex zeros of {(z) occur 
in conjugate pairs, ¢(s) is also real and positive on a segment (0, /] 
of the positive real axis for h sufficiently small. 


| 
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Hypothesis (b): 
2 


~ — ofZ'(0), 


—2?/6 — — 


2 


—o¢'(c) 


ll 


ll 


(—a¢'(c))’ 


Using (23), we find that the slope of —o¢’(c) is negative for suffi- 
ciently small ¢. Hence —o@’(c) tends monotonically to » as ¢—0 in 
the interval (0, h] for sufficiently small h. 

Hypothesis (c): /z=2"/?/2 while 

{ 2 { 2/3 + } 1/2 
= gilt —0 
—¢'(c) a? /6 — o°Z'(c) 

as ¢—0 by (23). 

Hypothesis (d): We see from a figure that +2 lies within a circle 


whose center is o and whose radius is 6(¢) =((2)/?/2)o0, o<1/2. 
Thus, 


(24) (1- <lots|<(1+ (e small enough). 


So 
| x(o + 2)| < Cx(c) uniformly in z. 


Putting the first inequality of (24) into |o’’"(o+2)| we have, from 
(23), 


| + z)| S C/o’. 
On the other hand, using (23) again, we obtain for sufficiently small ¢ 
> 


Thus: uniformly in z. 
Hypothesis (i): This has already been established. See (21). 
Hypothesis (ii): We must show that 


| g(s)| Cx(re*™, 


s—0, for every fixed 6 of (11). This is equivalent to proving that 


slog 2r exp (= + z(s)) 


2 
S exp (= +Z(r) + c). 
r 


The problem thus reduces to finding a C, independent of s, such that: 


—— 
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3? x? 
= zs) | —+ R[Z(r)] +. 
6s 6r 
Clearly, R[x?/6s]=2? cos 6/6r, see (11). 
Now we analyze: 


(5) Z(s) = 
In the first place, 
= = {cos (88 + log r) — i sin (80 + log r)}. 
Put 
(25) + p)I(e) = + i¥(o) = X + Yi. 


This gives 


1 
R[Z(s)] = >> { Xe” cos (08 + log r) + Yer sin (08 + log r)}. 


Thus we must show that 
1 
{ Xe” cos (68 + log r) 
r 


+ Ye sin (08 + y log r) 
— X cos (y log r) — Y sin (y log r)}. 


Now 1—cos 62/4 for |a| S7/2. If we expand cos (@8+¥7 log r) 
and sin (@8+v7 log r), and collect terms, it will be sufficient to 
show that: 


—+C2z> > { [X cos (y log r) + Y sin (y log r)]- [e” cos 08 — 1] 


+ [Y¥ cos (y log r) — X sin (y log r) | [e sin 98]}. 
But cos 68—1=(e7—1) cos 08+-(cos 68—1), and for all real x, 
|e=—1| Also 
a| 
| cos 98 — 1| = tutes s 
2 2 4 


Thus |e” cos 68—1| /4, while |sin 08] <|@|8<|@]. 
Hence it will be sufficient to show that 


| 
\ 
| 
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| 
+ 
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| 


Or, with all 8<fo, 1/2<BoS1, since |y| >2 for all y, 


—+C2 


292 | 0 


Referring to the definition of X and Y in (25), we see that the 
above series converges for all fixed values of 60, 90<a/2. The prob- 
| lem finally reduces, then, to finding a C; such that 

‘ 2K | 6| 
(26) —+C,2 —— where K; is a numerical constant. 
T 0 


This is equivalent to 


| 1 2 
( -— K3;} 20 with Bo = and"C, = 


The full Riemann hypothesis is essential for our method, since (26) 
can be written 


1 2 K; 
(27) +C:-—— 20. 


280-1 \ 


IV 


Unless 28)—1=0, no C; can be found which will satisfy (27) for all 
values of s simultaneously. For if Bp =1, we could choose @=r'/?, and 
then find the contradiction that (K}—(K3;—r?)*)/r is bounded. If 
) 1/2<Bo<1, we could choose @ as K3r'~* for sufficiently small r, and 
then we would have the contradiction that K;/r*- is bounded. 
Hypothesis (iii): This is assured by the definition of A(u). 
Since the conditions of Ingham’s theorem are satisfied, we have 


exp (x?/60 + Z(c) + ko) 


28 A(k) ~ 
(28) ( ) { + Z’"(a)) } 1/2 


where (7) holds. 
Because of (23), the actual contribution of the denominator is 


1/2 30° ” (3)*/2 
(29) (1 +22 ()) 
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Substituting (7) and (29) into (28), we obtain our final result, (6), 
where 


(6)/2 


1 
= exp(-14+ 5 + 
24 p 


(2) 


Ky 


and where ¢@ is the solution, existent and unique for ¢ small enough, 
of (7). Z(s) is given by (5). 


The author wishes to express his gratitude to Professor Hans A. 
Rademacher of the University of Pennsylvania for suggesting this 
problem and for his kind assistance and constant encouragement. 
This paper is presented in partial fulfillment of the requirements 
for the degree of Doctor of Philosophy at the University of Penn- 
sylvania. 
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ON FAITHFUL REPRESENTATIONS OF LIE GROUPS 
HARISH-CHANDRA 


Let G and H be two connected Lie groups and ¢ a continuous 
homomorphism of H into the group of automorphisms of G. Then we 
define a new group GX,4H as follows. The elements of GX,H are 
pairs (g, h) (g©G, hGH) and group multiplication is defined by 


(gi, 41) (ge, 42) = hike). 


Topologically GX,H is taken to be just the Cartesian product of G 
and H. It is then easily proved that under this topology GX,4H is a 
Lie group. It is called the semidirect product of G and H under ¢. The 
object of this note is to prove the following theorem. 


THEOREM. Let G be a connected, simply connected solvable Lie group 
and H a connected Lie group which has a faithful representation. Let 
be any continuous homomorphism of H into the group of automorphisms 
of G. Then GX4H has a faithful representation. 


The special case of this theorem when H is semisimple is due to 
Cartan.! 

Let R be the field of real numbers and K the field of either real or 
complex numbers. Let G be a connected Lie group with the Lie alge- 
bra g and @ a representation of G over K of degree d. Then we denote 
by d@ the representation of g given by? 


6 
10 t 


where ¢€ R and J is the unit matrix of degree d. Let GL(K, d) denote 
the group of all nonsingular matrices of degree d with coefficients in 
K. Any subgroup of GL(K, d) will be called a linear group of degree 
d. Let @ be the identity representation of a linear Lie group G with the 
Lie algebra g so that 0(x) =x (x€G). Then dé is a faithful representa- 
tion of g and exp d0(X)=@(exp X)=exp X for any x€g. Hence we 
may identify g with d@(g) under d@. g can therefore be regarded as a 
linear Lie algebra. In particular the Lie algebra of GL(K, d) then 
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consists of all matrices of degree d with coefficients in K. We denote 
it by gl(K, d). Given any subalgebra }CgI(K, d), by the linear Lie 
group generated by § we mean the analytic subgroup of GL(K, d) 
corresponding to b. 

Let us call a matrix subtriangular if it has zeros on and below the 
the diagonal. First we state the following two well known lemmas. 


Lemma 1.* Let Rt be the Lie algebra of all subtriangular d Xd matrices 
over K and let G be the linear Lie group generated by N. Then X—>exp X 
is a topological mapping of N onto G. Also G consists of all matrices 
which have zeros below the diagonal and 1 everywhere on the diagonal. 


Lema 2.‘ Let G be a connected, simply connected solvable Lie group. 
Then every analytic subgroup of G is closed and simply connected. 


From now on I adhere strictly to the notation of my paper, Faith- 
ful representations of Lie algebras,’ which will be quoted as FRL. 


Lema 3. Let 2, MN, D be as in Lemma 1 of FRL. We construct the 
faithful representation @ of 2+D as described there. Then for any 
Yi, Y,E2 and Dy, eee, D,EDd, 


(1) exp 0(Y:)--- exp 0(Y,) I unless exp Yi --- exp Y, = I”, 
(2) exp 0(D,) --- exp 0(D,) = I 

if and only if exp D, -- - exp D, = I’, 
where I, I', and I" are unit matrices of suitable degrees. 


If we use the notation of the proof of Lemma 1 of FRL, (1) follows 
immediately from the fact that where X¥* =¥/X. Now 
we prove (2). Put w*(X)=(w(X))* for XE&. Then it is easily 
proved by induction on s that for any DED 


{0(D) } *w*(X) = w*(D*X), s 


Hence (exp 0(D))w*(X)=w*((exp D)X). Also since 0(D)=dj is a 
derivation of %*=%/X, exp @(D) is an automorphism of A*. Put 
Yx=(exp D,) - - - (exp D,)X (X EX). Then if 


exp 0(D;) - - - exp 0(D,) = I, 


w*(Yx)—w*(X)=0 for every XE. Hence w( ¥x—X)CXC#k. 
Therefore mw( Yx—X) = Yx—X=0. Since this is true for every X, 


IV 


* Birkhoff, Ann. of Math. vol. 38 (1937) pp. 526-532. 
* Chevalley, Ann. of Math. vol. 42 (1941) pp. 668-675. 
* Harish-Chandra, Ann. of Math. vol. 50 (1949) pp. 68-76. 
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(exp D,) - - - (exp D,) =/'. Theconverse is obvious. Hence the lemma 
is proved. 


Lemma 4. Let G be a connected, simply connected solvable Lie group 
with Lie algebra g. Let R be the maximal nilpotent ideal of g. Then 
G has a faithful representation y such that d¥(X) is nilpotent for 
every XEN. 


By Corollary 1 of FRL we can find a faithful representation po of 
g such that po(X) is nilpotent for every XCM. We can therefore 
choose, if necessary, a new base in our representation space such that 
with respect to this base the matrix representing po(X) is subtri- 
angular for every XCM. Now consider the factor algebra g/N 
which is abelian and hence nilpotent. By the same corollary it follows 
that g/% has a faithful representation by nilpotent matrices. Hence 
g has a representation p; such that the kernel of p; is N and p:(X) 
is nilpotent for all X€g. We can again arrange that p:(X) is sub- 
triangular for all X. Put p=po+pi, where + denotes direct sum. 
Since G is simply connected, there exist representations Wo and y; of 
G such that dfo=po, Put Then dy=p. Let N be 
the analytic subgroup of G corresponding to Jt. Then from Lemma 2, 
N is a closed invariant subgroup. Consider ¥(N). It is clear that 
d¥(Y)=po(Y)+p.(Y) is subtriangular for all YER. Hence from 
Lemmas 1 and 2 it follows that the linear Lie group ¥(N) generated 
by dy(M) is simply connected. Since dy is an isomorphism, (JN) is 
locally isomorphic to N. Therefore since ¥(N) is simply connected, 
¥ maps N isomorphically. Similarly we prove that ¥(G) is simply 
connected. It is clear that the kernel of ¥, contains N. Hence y; de- 
fines a representation ¥* of G/N given by wW*(x*)=y:(x) where 
x—x* is the natural homomorphism of G onto G/N=G*. Since dy* 
is an isomorphism, the kernel of dji=p, being N, it follows from the 
simple connectivity of y*(G*)=y(G) that ¥* is an isomorphism. 
Hence the kernel of y is exactly N. Let D be the kernel of y. Then 
D is contained in the kernel of y which is N. Also since y is faithful 
on N, D(\N={e} where e is the unit element of N. Hence D= {e} 
and y is a faithful representation. Also d)(X) is nilpotent for every 
XEN. 

Now we come to the proof of the theorem. Let g be the Lie algebra 
of G and 9% the maximal nilpotent ideal of g. By Lemma 4, G has a 
faithful representation. Hence we may assume that G is a linear Lie 
group such that every element of & is nilpotent. We keep to the 
notation of Lemma 3 except that & is replaced by g. Let 6 be the Lie 
algebra of H. Define a homomorphism dr of § into D as follows. Let 
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Aut (G) be the group of automorphisms of G. Then Aut (G) is a Lie 
group with a Lie algebra %. It is well known that there exists an 
isomorphism A of &% onto D such that 


(exp A) exp X = exp ((exp A(A))X) 


for any ACY and X€g. We put dr=Xod@ where d¢ is the homo- 
morphism of § into &% induced by ¢. Then for any P;€b, 1S7Sr, 


o(exp P, - - - exp P,) exp X = exp ((exp dr(P)) - - - exp dr(P,))X). 


Let e and e’ denote the identity elements of G and H respectively. 
Suppose exp P; - - - exp P,=e’. Then clearly 


exp X = ¢(exp P; - - - exp P,) exp X 
= exp ((exp dr(P;) - - - exp dr(P,))X). 
Since this is true for every X Eq, 
exp dr(P;) -- - exp dr(P,) = I’. 
Hence we can define a representation t of H by the rule 
r(exp P, --- exp P,) = exp dr(P,)--- expdr(P,) (Pi,---, Pr 


Put =@o0dr. Then dy is a representation of Suppose exp P; - - 
exp P,=e’ (Pi, ---, P-€b). Then 


t(exp P, -- - exp P,) = exp dr(P)) - - - exp dr(P,) = I’ 
and from Lemma 3 
exp - - - exp dy(P,) = I. 
Hence we can again define a representation wy of H by putting 
v(exp P, --- exp P,) = exp dy(P:) --- expdy(P,) (Pi,---, PE §). 


Also since G is simply connected there exists a representation x of 
G such that dx(X)=0(X) for every X Gg. Hence 


x(exp Y,--- exp Y,) = exp --- exp @(Y,) (VY: G1 Sis). 


From Lemma 3 it follows that x is faithful. 
Consider the mapping of defined by u(g, h) =x(g)¥(A). 
We claim that p is a representation. For any PEh and X €g consider 


¥(exp P)x(exp X)(¥(exp = exp dy(P) exp 0(X) exp (—dy(P)) 
= exp 0(D) exp 0(X) exp (—@(D)) 
where D=dr(P). Now for any two elements A, BEgl(K, d), 


| 

| 
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exp A exp B exp (—A) = exp ((exp ad A)B) 


where ad A is defined as in FRL. Since [0(D), 0(Y)]=0([D, Y]) 
=6(DY) for any Y€gq, it follows immediately that 


exp 6(D) exp 0(X) exp (—0(D)) = exp @((exp D)X) 
exp 6(r(exp P)X) 
x(exp r(exp P)X) 
x(¢(exp P) exp X). 


Since any hCH can be written in the form exp P, - - - exp P,, 
P;€b, 1 r21, we get 


¥(h)x(exp X)(¥(h))* = exp X). 


Similarly since every g©&G can be written as exp Yi -- - exp Y,, 
Y;€q, 1S7Sr, r21, we have 


= x(o(h)g). 


ll 


Therefore 


u((gi, 41) (g2, = hike) = x(gib( hr) (hihe) 
= w(gi, (ge, he). 


Since yw is clearly a continuous mapping it is a representation of 
GX,H. By hypothesis H has a faithful representation vo. Define a 
representation v of GX4H by v(g, h) =vo(hk) and put Suppose 
(g, h) belongs to the kernel of Then since hk) =u(g, 2) +v0(h) 
= x(g)¥(h) +v0(h) and since v is faithful on H, h=e’. Hence g belongs 
to the kernel of x. But as x is faithful on G, g=e. Therefore (g, h) 
=(e, e’) and & is faithful on GX.4H. 


ll 


COROLLARY (MALCEV).! A connected solvable Lie group G has a 
faithful representation if and only if G=NA, where N is a closed, 
connected, simply connected invariant subgroup and A 1s a connected, 
compaci abelian subgroup such that N(.\A= {e } ‘ 


Suppose G= NA. For any a€A let ¢(a) denote the automorphism 
of N given by ¢(a)n=ana— (nEN). Then it is easily seen that 
(n, a)—na is an isomorphism of NX,A onto G. Since A is compact, 
it has a faithful representation. Hence by the above theorem it fol- 
lows immediately that G has a faithful representation. 

In order to establish the converse we make use of the following 
lemma which follows easily from the results of Chevalley.* 


| 
| 
| 
| 
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LemMaA 5.° If G is a connected solvable Lie group and N a closed, 
connected invariant subgroup such that G/N is compact, then there exists 


a compact connected abelian subgroup A of G such that G=AN and 
is finite. 


Returning to the corollary, suppose G is linear. Since g is solvable, 
we deduce in the usual way that every element X€[g, g]=Q’ is 
nilpotent and therefore may be assumed to be subtriangular. There- 
fore by Lemmas 1 and 2 the group G’ generated by g’ is simply con- 
nected. Let d be the degree of G and Gp the group of all matrices in 
GL(K, d) which have zero below the diagonal and 1 everywhere on 
the diagonal. By Lemma 2, G’ is closed in Go. However, since Go is 
clearly closed in GL(K, d), G’ is closed in GL(K, d) and therefore in 
G. Let x—>x* denote the natural homomorphism of G onto G/G’ =G*. 
Since G* is abelian, G* = T* V* where T* and V* are connected sub- 
groups, 7* being compact and V* simply connected and 7*/\V* 
={e*}. Let N be the complete inverse image of V* in G. Since 
N/G’ = V* and G’ are both simply connected, N is simply connected 
and G/N&T* is compact. Therefore by Lemma 5, G=AWN where A 
is compact, connected, and abelian and A/\N is finite. Let cGC ANN. 
Then o*=e for some r21. Then o*€ V* and (o*)’=e*. Since V* is 
simply connected and abelian, o*=e*. Hence c€A/l\G’. Since 
X—exp X is a topological mapping of g’ onto G’, it follows that 
o€G’, implies c=e. Hence A(\N= {e}. The corollary is 
therefore proved. 


INSTITUTE FOR ADVANCED STUDY 


6 This lemma was pointed out to me by Dr. G. D. Mostow. 


ON A PROBLEM OF E. CECH! 
J. NOVAK 


Let P be an abstract set and let $ be the system of subsets of P. 
An additive, single-valued set-function u taking $ into $ and satis- 
fying the conditions u(0)=0 and u(M)DM for each subset M of P 
is called a topology in P, more precisely, an additive topology? in 
P. The space P with the topology u is denoted by (P, u) or, briefly, 
P. Instead of u(M) we shall write simply «M. We define: 


wM = M, u'M = uM, = u(ué'M) for isolated ordinals é, 
and 


= U for non-isolated 
The set M is closed if M=uM. $(M) is the least ordinal number & 
for which the set u§M is closed, that is, u!AM=ut+!M. G(P, u) is the 
set of all ordinal numbers ¢(M) where MCP. 

E. Cech® has posed the following problem: What are necessary and 
sufficient conditions on a set H of ordinals in order that there exist a 
topology u in a countable set P for which H=G(P, u)? V. Jarnik* 
solved the generalization of the problem for a set of arbitrary 
cardinal number N,; his topology, however, is not an additive one, 
as it satisfies only the axiom of monotony. The present paper con- 
tains a solution to this problem when z is to be additive; this solution 
is stated in Theorem 1. The proof is carried through for a class of 
cardinals including No. 

First we have to prove the following lemma. 


Lemma. If the space (P, u) contains a non-closed subset, then there 
exists a non-closed subset ACP such that uA =u?A. 


Let M be a non-closed subset. The set uM— M is nonvoid: choose 
xin uM—M and let A = P—(x). By monotony of u. A#uA=P,so A 


Received by the editors November 12, 1948. 

1 This paper was rewritten for the Bulletin; the transcriber takes full responsibility 
for any errors that may have been introduced by this process. 

2 Every set function f satisfies the axiom of monotony: If MC_N, then f(M) Cf(N). 

3 E. Cech, Topologické prostory, Casopis pro Péstovan{ Matematiky a Fysiki vol. 
66 (1937) pp. D225—D264, p. D264. 

4V. Jarnik, Sur un probléme de M. Cech, Véstnik kr4lovské éeské spoletnosti nauk 
(1938) pp. 1-7. 
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is nonclosed but wA is closed.® 


THEOREM 1. Let H be a set of ordinal numbers. There exists an 
additive topology u in a countable set P (different from uM =M for all 
MCP) for which H=G(P, u) if and only if* 

1°. H contains only countable ordinals (that is, ordinals of the first 
and second number classes) including 0 and 1, and 

2°. If a and B are ordinal numbers such that a+B8CH, then BEH. 


Necessity. Obviously 0G H; by the lemma, 1 also is in H. Every 
¢(M) is countable since P can contain no uncountable ascending 
family of distinct sets.” 2° follows from the equality uf**M=yi4+1)y, 
which can be rewritten as u®(u!M) =u**1(u§M). (If P is an arbitrary 
set of cardinal &,, the same necessity proof works with 1° changed to 
allow only ordinals corresponding to cardinals not greater than N,.) 

Sufficiency. This proof will be carried through for cardinals &, 
satisfying the following condition: If R is a set of cardinal number 
N,, there exists a family S of cardinal number N,.; whose elements 
are subsets P, of R, X\<,4:, such that each P) is of cardinal &, while 


(1) Py, (\ P,, has cardinal less than &,. 


Such cardinals do exist; in particular, taking R to be the set of 
rational numbers, well-ordering the irrationals, and defining P, to be 
a sequence of rationals converging to the Ath irrational, shows that 
NS, is such a cardinal. Define M~O to mean that the cardinal of 
M<®,. 

We assume that H is a set of ordinals satisfying 1°, generalized to 
allow ordinals less than w,,:, and 2°. To topologize a set R of the given 
cardinal &,, assume that H is well-ordered by magnitude, H=ap, 
M1, with and AX<w,, ¢Sp+1. Then, by 1°, 
ao =0 and a,=1. For each A<w, and each £ Say let Py; be subsets of 
P, such that 


(3) Py, and — are of cardinal number 

and 

(4) = Pyand = U P,, for non-isolated 
OSn<t 


5 By footnote 2, u is a monotone. 
* The condition 2° is essentially the same as Jarnfk’s condition 3 in the paper cited 
under footnote 4. 


7V. Jarnfk defines =¢(P) =1; therefore according to Jarnik ex definitione 
1GH. 
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Set P,.1=0 and assume for convenience that —1 is less than all 
ordinals. 

For each subset M of R and each A<w, define the symbol (AM) 
to be the least index such that M(\(P,—P,;)~0. Then —15(M) 
Sa, and (AM) =-—1 if and only if M7\P,~0. We now prove 


(5) (A[M N]) = max [(AM), (AN)]}. 


Indeed it follows from the definition that (AA) S(AB) if ACB. 
Therefore (A[MUN])=max [(AM), (AN)]. Suppose that, on the 
other hand, (AM) <(AN); then the set (IMWUN)(\(Pyx—Piaam)~0 so 
that (A[ MUN]) [(AM), AN)]. 

For each A<w, define f,(&) for OSE<ay, fx(—1) =—1, and 
fr(@rx) =Qy. Say that a subset M of R is of kind a if there exist a finite 
set of indices, \i, ---,An, such that M—Uj<,P),~0; otherwise say 


that M is of kind 8. If F is a set of kind a, put 
F(M) = U 


where, for brevity, P,; and fy, have been replaced by P; and f;. If 
M is of kind a, define urM=MUF(M); if M is of kind 8, define 
uM=R. 

If M is of kind a, the set F(M) ts uniquely defined. Indeed, let 
UisnPr;, MCU j<mP,; and take then MMP,,;—Uj<mP,,~0. By 
(1) either A; is a or and =0. Hence every 
contribution to F(M) by a X; is supplied by a y;, and conversely. 

MN is of kind a if and only if both M and N are of kind a. Also 
for sets of kind a we have F(M.UN)=F(M)UF(N). Indeed, let 
MUN—Uis.Py,~0. By (5) the contribution, Piy,ajarUny, of the 
ith term to F(MWUN) comes either from F(M) or F(N). From this it 
follows that u is an additive topology. 

First, u is uniquely defined for each subset of R and MCuM; also 
u0 =O since the void set is of kind a. u is additive for sets of kind @ for 
u(MUN)=F(MUNJUMUN 
Every set M~0 is of kinda and F(M) =0souM=M if M~0. If either 
M or N is of kind 8, u[MUN]=R=uMUuN. 

Since (AP);) =§, (3) and (4) imply that wPy;=Pyg41 for OSE<ay 
and uP);=P, for =a. By transfinite induction the reader can easily 
verify from this and (4) that 


(6) y: = Pye, for E+yn Sa; = Pr for 


It remains to show that G(R, u) =H. By (6) with §=0, n=, it 
follows that 6(Px) =a,. Hence HCG(R, u). To establish the opposite 
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inequality we must prove ¢(M)C4H if MCR. lf uM=u?M, o(M)=0 
or 1, so it is in H by 1°. If uM+u?M, then M is of kind a, foruM=R 
=u?M if M is of kind 8. We note also that if uA =uB and uA¥u’°A, 
from the equalities u1A =u"B, it follows that ¢(A) =¢(B). Hence for 
we have 


uM = U VU K= u( U U K), 


tan 


where K~0. By the remarks above, since uM #u?M, we have 
oM = o( U U K). 
iSn 


By (6) and additivity of u, ¢6(M) =maxis, $(Pia;,) so it suffices to 
see that every $(P,) is in H. However if 7=¢(Px), then by (6) 
£+7=¢(Pro) by 2°, nEH. This completes the proof. 


Brno, CZECHOSLOVAKIA 


ON THE FACTORS AND FIBERINGS OF MANIFOLDS 
GAIL S. YOUNG, JR. 


1. Introduction. In this note I propose to discuss first some of the 
properties of factors of topological manifolds, possibly having 
boundaries. Particular emphasis is placed on the 1- and 2-dimen- 
sional factors, which are shown to be manifolds themselves, thus 
generalizing slightly results of Borsuk [2].! These results are then 
applied to fiber bundles, and to the question as to whether £*, 
Euclidean n-space, is a fiber bundle with a compact fiber. As a pre- 
liminary, I prove an elementary theorem which asserts that if a fiber 
bundle is connected and locally connected, then the components of 
the fiber are all homeomorphic, and which apparently has escaped 
notice. From this, the other theorems of this paper, and known results, 
follow easily several results concerning impossible fiberings of E*. In 
particular it is shown that, except for possibly a finite set as fiber, 
there is no compact fibering of E* or E‘, or of E5, one possibility ex- 
cepted. 

Many of the proofs are modifications of arguments due to Borsuk 
[2] and Montgomery and Samelson [7]. Several conversations with 
Samelson were helpful and stimulating. 


2. 1- and 2-dimensional factors of manifolds. The first two theo- 
rems of this section are extensions of Theorems 3 and 13 of Bor- 
suk’s [2] and lean heavily on his arguments. The ingenious use of 
the Brouwer theorem on invariance of domain in Theorem 1, orig- 
inating with Borsuk, was also discovered practically simultaneously 
by Szumbarski [10], and used for a similar purpose. 


THEOREM 1. If A XB is an n-manifold? (possibly with boundary) and 
some 0-dimensional subset of A separates an open subset of A, then A is 
homeomor phic to a connected subset of a simple closed curve. 


ProoF. We recall that since A XB is locally connected, so are A and 


Presented to the Society, April 17, 1948, under the title On product manifolds and 
fiberings; received by the editors November 20, 1948. 

1 Numbers in brackets refer to the bibliography at the end of the paper. 

2 A connected space M is an n-manifold if every point of M has a neighborhood 
which is homeomorphic to Euclidean n-space E*, or to the set of all points in E* 
whose nth coordinate is non-negative. The set of all points in M not lying in neighbor- 
hoods of the first kind constitute the boundary of M. We use A XB for the topological 
product of two spaces. 
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B. Let D be a connected open subset of A, and suppose that the 0-di- 
mensional subset T of D separates it. By a consequence of local con- 
nectivity [12; III, 1.4], we can assume that T is closed in D, so that 
T XB is closed in DXB and separates it. Since no subset of an m-mani- 
fold of dimension less than m—1 separates any open subset of it, 
dim TXB2n-—1. No point of T is an interior point, so that neither 
is any point of TXB. Hence dim TXB#¥n [5; Theorem IV, 3], and 
must be m—1. But, by [5; Theorem III, 4], dim BSdim TXB 
Sdim T+dim B=dim B, so that dim B=n—1. If A is not homeo- 
morphic to a subset of a circle, A contains a simple triod ax\/bxUcx. 
Since ax is 1-dimensional, dim ax X B =n, by [6], and hence there is a 
point (uz, v) in ax XB interior to this set, but not lying in the boundary 
B(AXB) of the manifold AXB. There is a homeomorphism 
h: ax—axb, sending u onto x, which induces a homeomorphism 
h’: ax X B—axb XB, defined by h’(p, g) =(h(p), g). By the Brouwer 
theorem on invariance of domain, h’(u, v) =(x, v) is in (Int axb XB) 
(\[A XB—B(A XB)]. But (x, v) is a limit point of (cx —x) XB, giving 
a contradiction. 


THEOREM 2. Let A XB be an n-manifold, with dim A =1, 2. Then A 


1s a mantfold, possibly with boundary, though having a boundary only if 
AXB does. 


Proor. If dim A =1, that A is a manifold follows from Theorem 1. 
To show this if dim A =2, we need only show that A has no local cut 
point, and that given a point a in A, there is an open set V containing 
a such that every simple closed curve in V separates A, by [16; 
Theorem 1.1]. The first condition follows from Theorem 1, and the 
second condition can be obtained from Borsuk’s argument for his 
Theorem 13, where he proves the same thing for a manifold without 
boundary. Only minor modifications are needed to take care of the 
complications introduced by the boundary. 

If A has a boundary, then B X6(A) is an —1)-dimensional 
closed subset of A XB separating no open set, which is impossible if 
B(A XB) is empty. This last remark also proves that no manifold with 
boundary is a factor of a manifold without boundary, which must be 
well known. 

M. Szumbarski has proved in [10] that every 1-dimensional factor 
of a closed n-cell, I", is a 1-cell, and that every factor of J", for 
1sn33, is itself a cell. In the second of these statements, if J” 
=A XB, and nS3, then dim A, dim B S2 (neglecting the trivial case 
of A or B being a point). Hence both of these propositions are im- 
plied by the following corollary of Theorem 2. 
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COROLLARY 2.1. Every 1- or 2-dimensional factor of a closed n-cell 
ts itself a closed cell. 


Proor. If "=A XB, each factor is homologically trivial, by the 
Kiinneth formula. Hence if dim A =1, A is a closed 1-cell, since it 
cannot be a 1-sphere. If dim A=2, A must be a 2-manifold with 
boundary, since no compact 2-manifold without boundary is homo- 
logically trivial. But the only homologically trivial 2-manifold with 
boundary is the closed 2-cell. 

Quite possibly every factor of a closed -cell is itself a cell. How- 
ever, J. H. C. Whitehead has found [11] two non-homeomorphic 
2-manifolds with boundary, such that the product spaces obtained 
by multiplying each with an interval are homeomorphic, so that such 
a conjecture does not carry overwhelming conviction. 


3. Factors with retraction properties. In this section I show that 
no manifold without boundary has as factor any set of a rather im- 
probable type. 


THEOREM 3. If A XB is an open subset of a closed orientable n-mant- 
fold M (without boundary), and A 1s compact, then no nowhere dense 
subset of A is a deformation retract of A. 


Proor. Suppose that the theorem is false. Then A contains a 
nowhere dense set R which is a deformation retract of A. If U is a 
small open subset of B with compact closure, then A X8(U) not only 
separates A XB, but also separates M, as AXU is compact. Now 
Wilder has proved [12, Theorem VII, 3.9] that if, in M, the (n—1)st 
Betti number of a closed subset T, pat(T), is k, then M—T has at 
most k+1 components. Hence p,-1[A X8(U)]2=1. I shall show that 
at least for some set U as described, p,1[A X8(U) ] =0, thus getting 
a contradiction. 

There is a deformation retraction r: A XJ-R satisfying r(a, 0) 
=a, r(x, t)=x, x in R, and r(A, 1)=R. For any set U, described 
above, this induces a deformation retraction r’: [A X8(U)]XI--R 
by the equation r’[(a, 5), ¢]=[(r(a), t], a in A, bin B(U), 
t in J. Then r’ induces a homomorphism r’: H,[A XB(U)] 
—H,|RXB8(U)], these being homology groups, which is well known 
to be an isomorphism onto. Hence if dim RX 8(U)<n-—1, implying 
Pn1[R XB(U)] =0, the proof would be complete. However, this fact 
is not obvious, and an analogous statement is not true for product 
spaces in general. Suppose, then, that for every U in B satisfying our 
conditions, dim RX $8(U)=n—1. Then there exists an uncountable 
collection, G, of such neighborhoods, the boundaries of no two inter- 
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secting.* Each set RX6(U) contains a Cantorian (”—1)-dimensional 
manifold, by [5; VI 8], and which, by [1], therefore separates some 
open n-cell in A XB. It is not hard to see that some open n-cell D in 
AXB is separated by uncountably many sets [RX8(U)|OD. By 
Whyburn’s [12; III, 2.2], there is an uncountable collection H of 
such sets forming a saturated collection; that is, given any point x in 
D, and a set X in H not containing x, some other set in H separates x 
from X in D. However, for each set [RX8(U)]OD in H, there is a 
point a in R which is arcwise accessible from A—R by an arc T, and 
which is such that, for some b in B(U), TXb is in [A X8(U)]|OD. It 
follows that each point of TXb—axXb is separated from [RX8(U)]| 
{\D in D by some element of H. But each two sets A X£6(U), 
AxXB(U’), U and U’ in G, are mutually exclusive, which gives a 
contradiction. 

Theorem 3 says, roughly speaking, that a set which is too much like 
a manifold with boundary cannot be a factor of a manifold without 
boundary, thus generalizing the statement following Theorem 2. 
Some light may be thrown on the proof by the following example. 
Pontrjagin [8] constructs two compact 2-dimensional spaces, A and 
B, such that dim A XB =3. Add a countable sequence of arcs, { T;}, 
to A, each having only an end point in common with A, with UT; 
dense in A, and with lim diam 7;=0. Let A’=A\W/UT;. Then A isa 
deformation retract of A’, and is nowhere dense in A’. But dim 
A’XB=dim A XB. 


4. Fiber bundles. Primarily to fix the notation, I shall state the 
definition of fiber bundle. A fiber bundle is the totality consisting of 
(a) three spaces A, B, and F, called the bundle, the base space, and the 
fiber, respectively; (b) a topological group G of homeomorphisms of F 
onto itself, called the structure group; (c) a continuous map, : AB, 
called the projection, with the property that for each } in B, p~'(d) 
is homeomorphic to F; (d) a covering { Ua} of B by open sets called 
admissible neighborhoods, and a family of homeomorphisms ¢.4: U. 
X F-p-'(U.) such that f) for each (5, f) in U.X F. These 
entities must also satisfy the following condition: For } fixed and each 
a such that U, contains b, consider the map ¢.3: F-—>p~'(b) defined 
by das(f) =¢a(b, f). If b is in let gas=Om' bas. Then gag is a 
homeomorphism of F onto F, and we require this to be in G. It was 
pointed out in [3] that for G the group of all homeomorphisms of F 
onto F, the definition can be simplified to (a), (c), and (d). The above 
notation will be used without explicit mention. 


* From this point on, the argument is a generalization of that given in my paper 
[14]. 


| 
| 


1950] ON THE FACTORS AND FIBERINGS OF MANIFOLDS 219 


5. Conditions for components of a fiber to be homeomorphic. I 
prove several simple results, that have apparently escaped notice, 
concerning fibers. 


THEOREM 4. If the fiber bundle A is a connected topological space, and 
the fiber F is the union of two separated sets, F, and F2, then some 
element of G maps a point of F, into Fs. 


ProoF. Suppose the theorem is false. Let D;, i=1, 2, be the set of 
all points a in A such that for some U, containing p(a) =b, 63'(a) is 
in F;. This definition is independent of the particular choice of Uz. 
For, if Us contains b, since ¢.(Fi) contains a, and since ¢%'¢a( Fi) is 
in F; (if for some g in G, g(F2) meets Fi, then g~'(F,) meets F2), we 
have is in F;. Clearly, Now U. X Fi, i=1, 2, is open 
in U.XF, which is homeomorphic to U.), so \p~( Uz) is open 
in A. But D;=U.Di\p-'(U.), so that D; is open in A. This implies 
that D, and D, form a separation of A, whereas A is connected. 


THEOREM 5. If the fiber bundle A is a connected topological space, and 
each component of F is open in F, then each two components of F are 
homeomorphic, and the structure group G acts transitively on the com- 
ponents of F. 


ProorF. Let F; be the union of a component of F and all components 
of F homeomorphic to it, and let F.=F—F,. Then F, and F, are 
mutually separated. By Theorem 4, some element g of G maps a 
point of F, into F,. But then some component of F; is mapped homeo- 
morphically onto a component of F; by g. This is impossible, so that 
F,=0. 

If the structure group G does not act transitively on the com- 
ponents of F, then F is the union of two separated sets F, and F, 
where no element of G maps a component of F; onto a component of 
F;. This again contradicts Theorem 4. 


CorRoLuary 5.1. If the fiber F has only a finite number of components, 
or if A is locally connected and connected, then the hypothesis on F in 
Theorem 5 is satisfied, and any two components of F are homeomorphic. 


The next result is suggested by the Eilenberg-Whyburn factoriza- 
tion theorem [12; VIII, 4.1]. For economy in space (!), I shall not 
state the most general hypothesis. 


THEOREM 6. Let the fiber bundle A be locally compact, metric, and 
connected. Let the fiber F have only a finite number of components. Let 
pb =Im be the monotone-light factorization of p, with m(A) =C. Then the 
map m defines a fibering of A with base space C, fiber F’ a component of 
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F, projection m and with structure group the group of all homeomorphisms 
of F’. The map | defines a fibering of C with base space B, fiber a set 
I(x), x in B, projection 1, and with structure group isomorphic to the 
structure group G of p, modulo the subgroup of G consisting of all those 


elements which leave each component of F fixed; C is a covering space of 
B. 


Notice that if we had that A were compact and locally connected, 
it would follow that F had only a finite number of components. 

Proor. By 5.1, all components of F are homeomorphic to one, F’. 
The space C is the space of all sets which are components of a set 
p(x), x in B, and whose open sets are defined as the set of all such 
components lying in an open set in A. Let U be an admissible neigh- 
borhood in B. Then p~!(U) is the union of a finite number of open sets, 
Vi, - ++, Vn, each homeomorphic to F’X U, and each kteing unions 
of components of sets p~!(x). Considering these components as points 
of C, the map /: CB is 1-1 and continuous on each set V/, where 
Vi is the open set in C whose elements are subsets of V;. It is easy 
to see that / is a homeomorphism on each V/, so that C is a covering 
space of B, and / is a fibering of C. The statement concerning the 
structure group for / follows from Theorem 5. 

Using the information about the sets V; obtained from 9, it is easy to 


see that m: A->C is a fibering as described, using the remark at the 
end of §4. 


6. Compact fiberings of E". We are now ready to apply the 
above results to the question as to whether E", Euclidean m-space, 


can be a fiber bundle with a compact fiber, raised by Montgomery 
and Samelson in [7]. 


THEOREM 7. There is no compact fibering of E* by a fiber, one com- 


ponent of which is a k-sphere, or a k-torus, or a set of the type described 
in Theorem 3. 


Proor. This is a consequence of Theorems 5, 6 and the fact that 
(a) as Montgomery and Samelson point out, Gysin’s methods [4] 
show that E" cannot be fibered with a sphere as fiber; (b) Eckmann, 
Samelson, and Whitehead [3] have proved that there is no such 
fibering with a k-torus as fiber; and (c) Theorem 3. 


THEOREM 8. There is no fibering of E", n>1, by a compact fiber of 
dimension 1 or n—1. 


Proor. We may assume, from Theorem 6, that F is connected. For 
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dim F=1, the theorem is a consequence of Borsuk’s result referred 
to above (and generalized in Theorem 2), and Theorem 7. 

If dim F=n—1, then dim B=1. For if dim B>1, B contains un- 
countably many mutually exclusive compact sets {C.} with dim 
C.=1, by [5; VII, 5]. Then by Hurewicz’s theorem, dim C,X F=n, 
so that each set C.F has interior points. But this contradicts the 
separability of E*. Hence B is either a simple closed curve or an 
open curve, since by Borsuk’s result, it is locally Euclidean. Now if 
x is in B, then p~'(x) separates some open subset of E* containing it, 
and hence separates E* into two sets each of which is an inverse set 
under p~'. It follows that B is not a simple closed curve. On the 
other hand, since p~'(x) is compact, one of its two complementary 
domains has a compact closure. Hence so does one component of 
B-—x. But then B cannot be an open curve. 


COROLLARY 8.1. There is no fibering of E* or E* by a compact fiber of 
dimension greater than 0. 


In [15] I gave a general theorem on interior mappings of E? from 
which the part of 8.1 concerning E? followed. As shown there, for E? 
and the case where the dimension of the fiber is 0, results of Whyburn 
prove that this is also impossible. For E*, »>2, the case of a compact 
0-dimensional fiber is quite difficult and includes the following: If 
f: E*-E* is periodic with period k, then not every orbit of a point 
under f has exactly & points. (If for some such mapping, every 
orbit had & points, it is easy to see that E* is a covering space of the 
orbit space, so that it is a fiber bundle with an orbit as fiber.) 


COROLLARY 8.2. There is no fibering of S*, n>1, by a fiber of dimen- 
ston n—1. 


Proor. Except that B cannot conceivably be an open curve, the 
proof is just like Theorem 8. 


THEOREM 9. There is no compact fibering of E*, n>2, by a fiber F of 
dimension n—2 onto a base space B of dimension 2. 


Proor. Suppose again that F is connected. Since dim B=2, it 
follows from Borsuk’s theorem that B is a 2-manifold. I now show that 
B is a plane, by showing that it is simply connected, the plane being 
the only non-compact simply connected 2-manifold without bound- 
ary. Let f: I-B be a closed path, J being the unit interval. Without 
loss, we can assume that f is constant over [1/2, 1]. By covering 
homotopy [9], and arcwise connectivity of F, there is a closed path 
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¢@: I->E* such that po=f. Since E* is simply connected, ¢ is homo- 
topic to a constant mapping, so that so is f by the homotopy induced 
in B by p. 

Let U be an admissible 2-cell neighborhood in B, and let J be a 
simple closed curve in U. Then J separates B into two connected 
sets, so that p~-'(J) separates E* into two connected sets. By the 
Alexander duality theorem, },1[p-'(J)]=1, 5, denoting the kth 
Betti number. Since U is admissible, p~'(J) is homeomorphic to 
JXF, and by the Kiinneth formula we must have },_.(F) =1. It 
follows that for any point x in B, b,_2[p—'(x)]=1, so that in E* 
—p~ (x) there is a (singular) 1-cycle, Z', linked with p—'(x). Add a 
point w,, w2 to each of E*, B to make them both spheres. Consider 
the singular 1-cycle p(Z') in B. This lies in BUw.—x, an open 2-cell, 
so is contractible to w in this set by a homotopy h(y, #). Since covering 
homotopy holds for compact subsets of B, we can move Z' into an 
arbitrary neighborhood of w, by a homotopy covering h(y, #) for 
some range of ¢. This implies that Z' bounds in (E"Uw,) —p-'(x), a 
contradiction. 

This argument is quite closely related to Montgomery and Samel- 
son’s [7; Lemma 2]. I might note that if b,2(F) +0, it would follow 
that every sufficiently small simple closed curve in B would separate 
it, so that B would be a 2-manifold, and the condition on dim B 
could be removed. 


THEOREM 10. There is no fibering of E* by a fiber F of dimension 2. 


Proor. Assuming F connected, it follows from Theorem 2 that it is 
a 2-manifold. Hence dim B=n—2, and we can assume dim B>1, or 
the result follows from Theorem 8. Then B has no local cut points, so 
that there exists an open set U, with U in an admissible neighbor- 
hood, such that 6(U) separates B into two connected open sets. 
Then b,_1[FX8(U)]=1, so that b.(F)=1. Montgomery and Samel- 
son point out that the fundamental group of F must be abelian, and 
the only two orientable 2-manifolds without boundary having this 
property are the sphere and the torus. Hence Theorem 10 follows 
from Theorem 7. 


CoROLLARY 10.1. There is no fibering of E*, E® by a compact fiber 
of dimension greater than 0, except possibly for if dim F=3, dim B>2. 


Proor. Each possible dimension of F is covered by one of Theorems 
7, 8, 9, and 10. 
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A PROBLEM ON INVERSE MAPPING SYSTEMS! 
LEON HENKIN? 


Let A be a set of indices a, 8, - - - directed by an ordering relation 
<; that is, for each a, 8 there is a y such that a<¥y and 8<y7. By an 
inverse-mapping-onto-system of type A is meant a family of non- 
empty disjoint sets M. together with functions for each 
mapping Ms; onto M,. It is required that these functions be such that 
if a<B<vy then for each My. The inverse limit of 
such a system consists of those functions f defined over A such that 
f(a) =x.€ M, and if then I’x,=x,. If every inverse-mapping- 
onto-system of type A has a nonempty inverse limit then the directed 
set A is called special. 


THEOREM. The necessary and sufficient condition for A to be special 
ts that it possess either a maximal element or a simple cofinal sequence. 


To show sufficiency in the case where A has a maximal element is 
trivial. If a;<a2:< - - - is a simple cofinal sequence in A, an element 
of the inverse limit may be constructed by choosing x., arbitrarily in 
M.,, and selecting xa;,, to satisfy since 
II is always onto. Finally, in case B is not one of the ai, it is easily 
shown that xg is uniquely defined by the formula x3=II#‘x.,, where 
any a; such that B<a; is employed. (There must be one since {a;} 
is cofinal.) This simple argument is contained in Tukey’s paper.' 

To show necessity we assume that A is special and construct the 
following inverse-mapping-onto-system of type A. 

A point is a finite sequence x= (aj, a2, , Consisting 
of an even number of indices from A which satisfy the following 
conditions: 

(i) a1 <az, 

(ii) Q2i-1 <Qei+e for 0<i<n, 

(iii) and for OS 7 <i<n, where holds 
when neither nor a=8. 

We define index x=Oan-1, order X=Q2,, length x =n. 

Let M, consist of all points of index a. Given a<§, we define 
the mapping II% by prescribing its image for an arbitrary point 


Received by the editors November 26, 1948. 

1 The problem treated and notation used in this paper are taken from J. W. Tukey, 
On denumerability in topology. The theorem proved is one of two alternative con- 
jectures made there about the problem. 

* Frank B. Jewett Fellow. 
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=(a4, 2, Gen) Of Mg (so that as follows. In case 
aSay, let T8x=(a, a2); since x is a point we have a, <az, whence 
a Sa; implies so that a) is a point. In case we can 
find (since a<8 =aen_1) a least such that a Sa2;4; (whence 
for k<j). In that case we set a2, ---, Q2j, &, O2;42), Which 
is easily seen to satisfy conditions i, ii, and iii if x does. 

These sets M. and mappings IIé form an inverse-mapping-onto- 
system of type A. It is a simple matter to check the transitivity of 
the mappings: for a<B<vy. To see that II8 is onto (where 
a<B), let x =(a1, a2, - - , be any point of M, (so that ae,1=a). 
Choose and consider the sequence y=(a1, a2, , Qen, B, Y). 
Since x is a point and a2,,=a<B<y, it is only necessary to verify 
that BXoej1, j=0, 2,--+-,n—1, in order to conclude that y is a 
point. But this is certainly the case since B Sa2;,; and a<B imply 
Q2on—1 =A <Q2;41 contrary to the fact that x is a point and so satisfies 
iii. It is now easily seen that y is in Mg and IIfy=x. 

Since we are assuming that A is special it follows that the above 
inverse-mapping-onto-system has a non-empty inverse limit. Let f 
be a function defined over A such that f(a) =x.G Ma, where a<8 
implies II8x,=x.. The set of orders of points x. is cofinal in A, since 
for any a we have a=index x.<order x, (by i and iii). Hence we 
can prove our theorem by showing that the orders of points x. 
either possess a maximal element or form a simple sequence. 

To see this last fact it is only necessary to observe that if length 
Xa=length xs then order x,=order xs. For choose y so that a<y 
and Hence we must have =x, and IIjx, =x,. But from the 
definition of the mappings II it then follows that the orders of x. 
and xg are certain elements a2; and a; in the sequence x,. Since 
i=length x.=length x3=j we have order x.=a2;=a2;=order Xz. 

Thus if the lengths of the points x. are unbounded there will be 
a simple sequence of orders B (one for each length), which is cofinal 
in A. In the contrary case the orders form a finite cofinal subset of A 
and so one of them must be maximal. 

This completes the proof of our theorem. 


Coro.iary I. If A is special and B is a cofinal subset of A then B ts 
special. 

Coro.iary II. Without the axiom of choice we can construct, for 
each directed set A, an inverse-mapping-onto-system of iype A which 
has a nonempty inverse limit if and only if A is special. 
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ON UPPER SEMICONTINUOUS FUNCTIONS 


E. J. MICKLE AND T. RADO 


1. Introduction. In his important paper [1] (see Bibliography) on 
the theory of the Lebesgue area of surfaces, L. Cesari established the 
following interesting result. Let X be the unit cube 05x;<1, 


j=1, ---,m, of Euclidean m-space and let Y be the unit cube 0 Sy, 
«=1,---,m, of Euclidean m-space. Let F(x1,---, Xm, 
bea real-valued, bounded, upper semicontinuous func- 
tion in the product space X X Y. Then for each point (x, - - - , Xm) 
€X, Fis a bounded, upper semicontinuous function of (y1, - - - , Yn) 
€ Y, and hence F attains, for each fixed (x1, - - - , Xm) CX, a maxi- 


mum value M(x, + - - , xm), while (y1, - - + , yn.) varies in Y. Accord- 
ing to the theorem of Cesari, there exists then in X Borel measurable 
functions y;=©®,(x1, - - - , Xm), #=1, - - , m, such that identically 


Flax, (x, Zn), ®,,( x1, Xm) | 
M(x1,--*+, Xm)- 


The proof given by Cesari is essentially an induction proof. The pur- 
pose of this note is to give a direct proof for a more general result 
(see our Theorem 3.1) which contains the theorem of Cesari as a very 
special case. In this generalized result, X is any metric space and Y 
is any compact metric space. In this general setting, the concept of a 
real-valued Borel measurable function is replaced by the following 
concept (cf. Kuratowski [2, p. 177]). A single-valued transformation 
T from a metric space S; into a metric space S: is termed a Borel 
transformation if for every closed set F.C S: the inverse set T—1( Fs) 
is a Borel set in S; (and, hence, if the inverse of every Borel set in 
S: is a Borel set in Sj). 


2. Preliminaries. Throughout this section X will denote a metric 
space and J will denote the unit interval 0<#<1. Points in X will be 
denoted by x. 


LEMMA 2.1. In the product space X XI let E be a set which satisfies 
the following conditions. 


(a) For each to the set 
FZ, [(x, t) = E, t to| 


is a Borel set. 
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(b) For each xo the set 
E [(xo, t) = 


is a closed interval containing 0. Then E is a Borel set. 


ProoF. Let 71, 72, - - - be the set of rational numbers in J. For each 
pair of integers m and m we set 


En= E 1) € 1). 


It follows from condition (a) that each set E>, is a Borel set. Thus the 
sets 


En = >. m = 1,2,--- 


are Borel sets. The reader will verify readily that E= [],,En. There- 
fore E is a Borel set. 


LemMA 2.2. In the product space X XI let E be a set which satisfies 
the following conditions. 
(a) For each to the set 


-, [(x, E, t = 


ts a Borel set. 
(b) For each xo the set 


E [(xo, HE E] 


ts a closed interval containing 1. Then E is a Borel set. 
Proor. The proof is similar to that used in Lemma 2.1. 


LEMMA 2.3. Let t=f(x) be a single-valued transformation from X into 
I and in the product space X XI let 


A necessary and sufficient condition that t=f(x) be a Borel transforma- 


tion is that Ex be a Borel set. The same statement holds with Ex replaced 
by E*. 


Proor. Assume that ¢=f(x) is a Borel transformation. Then Ey 
and £* clearly satisfy the conditions of Lemma 2.1 and Lemma 2.2 
respectively. Hence Es and E* are Borel sets. 
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Assume that Ex is a Borel set. We assert that f-'(B) is a Borel set 
whenever B is a Borel set in J. By well known results it is sufficient 
to prove this assertion for the case where 


(1) 1]. 


Let B be a Borel set as given in (1). Set 
X°= By = 


Let P: (x, t)—>x be the projection of X XJ onto X. From the defini- 
tion of Ex and by well known properties of Borel sets we have the 
following facts concerning the sets B and Bx. (i) f-'(B) = P(Bx). (ii) 
Since Ex is assumed to be a Borel set, Bx is a Borel subset of X°. 
(iii) Since P is a bicontinuous transformation from X° onto X and 
Bx is a Borel subset of X°, P(Bs) is a Borel set. From (i) and (iii) it 
follows that f-'(B) is a Borel set. 

A similar reasoning applies if E* is assumed to be a Borel set. 

Let g(x, ¢) be a real-valued, bounded, upper semicontinuous func- 
tion defined on the product space X XC where C is a closed subset of 
I and C contains 0. Set 


M(x) = max g(x, #) for x fixed, # EC. 


Since g(x, #) is an upper semicontinuous function and C is compact, 
the set 


E(x) = E [g(x, t) = M(x), x fixed] 


is a closed subset of C for each x. Set 

(2) t = f(x) = for 7 € E(x). 
LEMMA 2.4. Under the above conditions the transformation t=f(x) 

defined in (2) is a Borel transformation from X into C and for eachxCGX 

(3) g[x, f(x)] = M(z). 


Proor. The relation (3) follows from the definition of t=/(x) in (2). 
To prove that t=f(x) is a Borel transformation we set 


gi(x,?) = maxg(x,r) EC. 


Since 0 is in C, gi(x, #) is defined on X XI. It is easily shown that M(x) 
and g:(x, #) are bounded, upper semicontinuous functions on X XJ 
and g:(x, S$ M(x). Thus the function 
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is a Borel measurable function. Hence the set 


Eo= E, 4) = 0] 


is a Borel set. It is easily shown that the set Ep» is equal to the set 


E*= E (f(x) sts 1). 


(z,t) 


Therefore, by the result in Lemma 2.3, t=f(x) is a Borel transforma- 
tion. 


The reader can easily verify the following result on Borel trans- 
formations. 


LEMMA 2.5. Let Y be a metric space which is the product of n metric 
spaces Y;,---, Y, and let 


T: = i=1,---,n, Vi, 


be a single-valued transformation from X into Y. A necessary and suffi- 
cient condition that T be a Borel transformation from X into Y is that 


each of the transformations y;=®;,(x) be a Borel transformation from 
X into 


3. The main results. 


THEOREM 3.1. Let X be a metric space, let Y be a compact metric 
space, and let F(x, y) be a real-valued, bounded, upper semicontinuous 
function defined on the product space X X Y. Then there exists a Borel 
transformation 


(4) y = 
from X into Y such that for each xEX 
(5) Flx, (x)] = M(x), where M(x) = max F(x, y) for x fixed, yCY. 


Proor. Since Y is compact, Y is the continuous image of a closed 
subset C of the unit interval J:0 #1, where C contains 0 (see Why- 
burn [3, p. 34]). Let us denote this continuous transformation by 


y=¢(t), 
Then 


(6) g(x, t) = F[x, 


is a real-valued, bounded, upper semicontinuous function defined on 
the product space X XC and (see (5)) 


M(x) = max g(x, ?#) for x fixed, ? EC. 


= 
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The transformation ¢=f(x) defined in Lemma 2.4 in terms of g(x?) 
is a Borel transformation from X into C for which 


(7) glx, f(x)] = M(x). 
Set 
(8) y = = ¢[f(x)]. 


From (6), (7), and (8) 
F[x, &(x)] = F[x, 6(f(«))] = glx, f(x)] = M(2). 


Thus the transformation defined in (8) satisfies the relation (5). Let 
B bea Borel set in Y. Then 


= f[6-(B)]. 


Since y=¢(#) is a continuous transformation from C onto Y, @~'(B) 
is a Borel subset of C, and since t=f(x) is a Borel transformation from 
X into C, f-![¢-(B) | is a Borel set. Thus the transformation defined 
in (8) is a Borel transformation from X into Y which satisfies the 
conditions of the theorem. 

If in the preceding theorem Y is the product of m metric spaces 
Y;, -- +, Y,, the Borel transformation (4) can be written in the form 


(9) y= i= 1,--- 
By Lemma 2.5 we thus have the result: 


Coro.uary. Each transformation y;=®;(x),i=1,---, n, given in 
(9) ts a Borel transformation from X into Yj. 


If X coincides with the unit cube 0Sx;51, j=1,---,m, of 
Euclidean m-space and Y coincides with the unit cube 0<y;<1, 
#=1,---,m, of Euclidean n-space, then we obtain as a special case 
the theorem of Cesari stated in the introduction. 
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A NOTE ON PEANO SPACES 
JEAN M. BOYER AND D. W. HALL 


In three-dimensional space set up a cylindrical coordinate system 
(r, 6, z). The Hahn-Mazurkiewicz theorem characterizes Peano 
spaces (locally connected metric (compact) continua) as the continu- 
ous images of the closed unit interval J on the z-axis. In this note we 
obtain an extension theorem for Peano spaces (henceforth called P- 
spaces) based upon this characterization. 

We first define a dendrite L. To this end order the rationals in the 
interior of J into a sequence { (0, 0, r:}. For each pair of positive 
integers 7, 7, let L;,; denote the closed line segment joining (0, 0, 74) 
and (1/i+j, 1/i+j, ri). Let L_1,;, Lo,; denote line segments from 
(0, 0, 0) and (0, 0, 1) parallel to and the same length as Ly,; for every j. 
Then the dendrite L is defined to be the union of J and all the seg- 
ments L; ;. Let a;,; be the end point of L;,; which is not on J. We shall 
refer to a;,; as the free end of L;,;. 

Denote by D the sequence { (0, 0, d;)} consisting of the dyadic 
rational points interior to J enumerated in the usual way: d;=1/2, 
dz=1/2?, d;=3/2?, - - -. The following lemma is then easily estab- 
lished. 


Lemma. Let Q* be any finite or countable subset of I. Then there exists 
a homeomorphism h(I)=I such that k(0)=0, h(i)=1, 4(Q*—0-—1) 
CD. 


The result of this note may now be stated as follows. 


THEOREM. Let N be a P-space properly contained in the P-space P, 
and let g(I)=N be any continuous transformation. Then there extsts a 
homeomorphism h(I)=I and a continuous transformation f(L)=P 
with the following properties: 

(i) on I, f=gh 

(ii) 

In addition there exists a subset L* of L, consisting of the union of I 
and a certain subcollection Lf, of the line segments L;,; such that 

(iii) =P—-N, 
and 

(iv) if af, is the free end of Lit, for each i, j while A* is the union of 
the points then —N) =L* —I-—A*. 
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Proor. The proof will be given only for the nontrivial case in which 
N is nondegenerate. Let T(J) =P be any continuous transformation. 
Then the open set G= 7-!(P—N) is the union of a collection of open 
intervals {w;} having the respective left and right end points {4q;} 
and {t;}. It is convenient to assume that this sequence of intervals 
has been enumerated in order of decreasing diameters. 

For each integer ¢ for which a point gq; has been defined we choose 
a point g* in g-!T(q;). It does not follow that g;#q; implies g* ¥q}. 
However, there are at most a countable number of g; which have the 
same g*. We denote these by qi,1, gi,2, order of decreasing 
diameters of the corresponding w;. Thus g*=qj, for all 7. Denote by 
Q* the set of all points q* thus defined, and let k(J) =J be the homeo- 
morphism given by the lemma, and h(J)=J the homeomorphism 
given by h=&k-". For each pair of integers 7, 7 for which a point q;,; 
has been defined, let w;,; be the component of G which has q;,; as its 
left end point. Set d* = k(q*) for each 7. Then, for each pair of integers 
i, j for which a point q;,; has been defined, denote by L*, the line 
segment of L which has the point d* as its foot, and the point at; 
as its free end. 

The desired transformation f(L) =P is now defined as follows: 

(a) f(D) =gh(1D) =N. Thus (i) is satisfied. 

(b) f(Zi,;) =f(ds) for every Li,; which is not an Lf,. 

(c) f(Zg)) =T(ai,;) in such a way that f(d*) agrees with the 
definition in (a), f(at))=T(ri,;), and f(Lt,—d* —af,) = T(w;,;). Here, 
of course, ¢;,; denotes the right end point of w;,;. 

It is easily seen that the transformation f(L)=P satisfies all of 
the required conditions. 

In the important special case where N is a simple arc we may 


choose g(J)=N as a homeomorphism and obtain the following 
corollary. 


Coro.uary. If N is a simple arc in P, then f(L) =P may be so de- 
fined that f(I) =N is topological. 


This theorem may be restated as a decomposition theorem for 
the P-space P, where each of the sets f(L{,) is regarded as a P-space. 
Recent results of O. G. Harrold! should not be overlooked in the light 
of this theorem. 


THE UNIVERSITY OF MARYLAND 


1 See O. G. Harrold, Duke Math. J. vol. 6 (1940) pp. 750-752. Also Bull. Amer. 
Math. Soc. vol. 48 (1942) pp. 561-566. 
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NONLINEAR NETWORKS. IV 
R. J. DUFFIN 


It may be well to point out explicitly that the networks considered 
in this paper and in the preceding papers are definitely not arbitrary 
nonlinear networks. Rather, it has been the aim to define nonlinear 
networks which share the uniqueness properties of passive linear net- 
works. Physical considerations suggest that such networks may be 
aptly termed reliable networks. 

The networks considered here consist of transformers with ferro- 
magnetic cores (assumed to have negligible hysteresis) and Ohmic 
resistors, arbitrarily interconnected to a set of generators. It is found 
that as in the case of linear networks, under given electromotive 
forces from the generators, the currents flowing are uniquely de- 
termined after sufficient time has elapsed. Moreover, if the electro- 
motive force is periodic, a unique, periodic current flow can exist. 

The proof begins by integrating the network equations with re- 
spect to time. It turns out that the integrated equations have the 
same form as the network equations of a network containing linear 
capacitors and nonlinear resistors. The differential permeability of a 
ferromagnetic lies between positive limits; in the integrated equations 
this is imaged by the condition that the differential resistance lies 
between positive limits. The results stated are then easily deduced 
from theorems developed in preceding papers for networks containing 
such quasi-linear resistors [1, 2].! (These papers will hereinafter be 
referred to as I and III.) 

In most applications of transformers the nonlinearity of the mag- 
netization curve is an undesirable feature; however, the operation of 
flux gate magnetometers, magnetic voltage regulators, frequency 
multipliers, and magnetic amplifiers depends essentially on the non- 
linearity. 

Considerable space is devoted to the concepts of a transformer and 
of a transformer network. In §1 the transformers are restricted to be 
of asymmetrical type so that their properties can be determined fairly 
rigorously from elementary considerations. The general transformer 
is treated in §4 by means of the hypothesis of the magnetic network. 
The concept of topologically coupled networks employed there will 
be found quite interesting from an abstract point of view. 


Presented to the Society, December 30, 1948; received by the editors November 
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1. Transformer networks. By a “transformer” is meant, in this 
section, a set of coils of insulated wire wound in uniform layers on a 
ring-shaped core of ferromagnetic material. A transformer network 
is a collection of transformers and generators, the various coils and 
generators being arbitrarily interconnected. Any conservative dis- 
tribution of electric current in the network may be uniquely ex- 
pressed as a superposition of cyclic currents 4, 72, - - - , 7, flowing in 
closed circuits, provided these circuits are selected so as to be a 
maximal linearly independent set. A positive direction of circulation 
is arbitrarily assigned to each circuit. Let e, be the sum of the electro- 
motive forces of the generators acting in the rth circuit. By Fara- 
day’s law of induction, e,=dv,/dt where v, is the net magnetic flux 
linked with the rth circuit. This assumes, of course, that these circuits 
have no resistance. To express equations which relate electromotive 
forces and currents, it is now necessary to express the magnetic 
fluxes as a function of the currents. 

The core of each transformer will be regarded as composed of a 
number of co-axial sub-rings with “small” cross-sectional diameter. 
By the assumed geometric symmetry it follows that the tangential 
magnetic field H is the same at every point of the sub-ring. The sign 
of H is determined by arbitrarily prescribing a direction of circula- 
tion in the sub-ring. According to Ampere’s law the line integral of 
the tangential component of the magnetic field around a closed path is 
given by the net current linked with the path. (This assumes the so- 
called rational units.) If the circumference of the ring is s, then 
sH= >> Cxix. Here C; is the net number of times the electric circuit 
k links the sub-ring, the sense of linkage being determined by the 
right-hand rule. 

In ferromagnetics the flux density, B, satisfies the relation B=p(A), 
and if dB =y’dH, then it is an experimental fact that the differential 
permeability, uw’, satisfies the relation u,Su’SA. Here yp, is the 
permeability of the vacuum. The positive constant A may be taken so 
large that this relation is uniform for all sub-rings, and in addition, 
that A-'<yp,. Let f be the magnetic flux flowing in the sub-ring 
under consideration. Then f=ap( > >Cyixs~"). Here a is the cross- 
sectional area of the ring. For convenience in notation this last rela- 
tion may be written f = U( >>Cyix) where the function U has a deriva- 
tive u lying between positive limits. 

Let the sub-rings from all the transformers be placed in an ordered 
sequence. The variables of the sub-rings will be distinguished by 
superscripts. Thus the net flux linked with the rth electrical circuit 
is v= >_C%fi. Introducing the relation for fi found above gives the 
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desired relation 


Differentiating with respect to time yields 


(2) => = 
Here i; =di,/dt, and the matrix L4 is defined by first summing over 
j; it may be called the inductance matrix. It is clear that Li is sym- 


metric; moreover, it is semi-definite, as may be seen from the follow- 
ing relation: 


Relation (2) would of course remain valid if the core material were 
replaced by a material having a constant permeability of unity. Let 


the inductance matrix in this case be designated as L,,; then it is 
clear from (3) that the following inequality holds: 


The following observation is of interest in the theory of linear net- 
works. Relation (3) shows that the inductance matrix L is semi- 
definite; this, of course, has long been known. The same formula 
shows that, conversely, any semt-definite matrix is the inductance 
matrix of some network. To see this, note that L is here expressed in the 
form L = K*K where K* is the transpose of the matrix K. On the other 
hand, it is a relatively easy problem of matrix theory to show that 
any semi-definite matrix permits of such a factoring. The observa- 
tion that an arbitrary matrix K can be achieved, in principle, by a 
suitable coil design completes the proof. (R. M. Foster has informed 
the writer that he also discovered this result and presented it some 
time ago in his classroom lectures. Foster’s proof is quite simple.) 


2. Quasi-linear replacements. The nonlinear relation (1) just 
obtained between the flux vector, v, and the current vector, i, is of 
the type which was called, in I and III, a quasi-linear replacement.? 


2 The following corrective paragraph should be added to the paper Nonlinear 
networks. 111 at the bottom of page 122: 

“The one-dimensional mean value — V(x) — V(y) = V’x— V’y remains valid 
in » dimensions if vi is interpreted as Vi =S,V' (ide. Here @ is a scalar and i=y 
+6(x—y). Clearly ve is a symmetric matrix. Moreover, integrating (iii) with respect 
to @ shows that V,, satisfies (iii). Thus V. satisfies the conditions (ii) and (iii) on V’, 
so in what follows the subscript m is deleted. While this notation causes some am- 
biguity, it is precisely the notation employed in Nonlinear networks. 1.” 
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Consider, then, an abstract transformation, »= V(z), of an n-dimen- 
sional vector space with corresponding differential transformation, 
dv= V'di, where V’ is the matrix of differential coefficients. If L is a 
symmetric semi-definite matrix, the transformation V(z) is a quasi- 
linear replacement of the linear transformation Lz if: 

(i) V(O)=0; 

(ii) V’(4) exists for all ¢ as a symmetric matrix; 

(iii) there is a positive constant, A, such that for all vectors 7 and 
z, A~'(Lz, 2) S$(V’z, 2) SA(Lz, 2). 

In the last line the standard notation for the inner product has 
been employed; that is, if x and y are two vectors, then (x, y) = oxen. 
The following simple consequence will be needed later: 


LemMA. If V(t) ts a quasi-linear replacement of Li, then so also is 
Vi(t) = V+k)— V(k) where k is a constant vector. 


To indicate the nature of a quasi-linear transformation, several of 
its properties will now be mentioned: If M is the manifold determined 
by vectors of the form Zi, and if k is any vector perpendicular to this 
manifold, then V(t+k) = V(z). All vectors of the form V(z) are con- 
tained in M. If 9 is in M, the relation v= V(z) has a unique solution 
for i in M, i= W(v). Moreover, the domain of W may be extended so 
that it also is a quasi-linear replacement of Li. Proofs of these state- 
ments are not difficult and are contained essentially in I and III. 


3. Transformer resistor networks. To obtain the equations defin- 
ing the flow of current in a general network containing linear trans- 
formers, linear resistors, and linear capacitors, it is simply necessary 
to add voltage drops due to the resistors and capacitors to the equa- 
tion already obtained for the inductive voltage drops. It is con- 
venient to introduce the vector y of electric charge, which is such that 
#=y’. Then the general network equations are 


(S) Ly" + Ry’ + Sy = «. 


Here R is the matrix of resistors, its diagonal elements, say Ri, 
being the total resistance in the kth circuit. The off-diagonal, R,,, 
has magnitude equal to the common resistance of the circuits k and 
r, and its sign depends on the relative directions assigned to the cir- 
cuits r and k. The formalism developed in §1 may be adapted to prove 
these and other properties of R. Interpret C? as the incidence matrix 
between the branches j and the circuits r of the graph. Thus, C?=1 
if the branch j occurs in the rth circuit and their senses agree, and 
C/ = —1 if their senses disagree. If the branch j does not occur in the 
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rth circuit, C/=0. In relation (1) interpret v, as the net voltage drop 
across the resistors of the rth circuit, and interpret U/ as the re- 
sistance of the jth branch. Then it may be seen from relation (3) that 
R is semi-definite. It also follows from (3) that if none of the branches 
have zero resistance, R is definite. For quasi-linear resistors, U/ is 
interpreted as the voltage current function, and the same proof shows 
that a quasi-linear replacement arises. 

It is not true that every symmetric semi-definite matrix is a re- 
sistor matrix of some network. To state in algebraic terms the restric- 
tions on R to be sure that such a matrix is actually a representation 
of a network is a topological problem of graphs which has not been 
solved. The matrix S relating to the voltage drops across the ca- 
pacitors has exactly the same restrictions and properties as that of R. 

It is to be noted that the assumption that one or more of the 
matrices L, R, and S could be singular is an idealization. Thus all 
actual conductors have a nonvanishing resistance, and if this were 
assumed, it would immediately follow that the matrix R is non- 
singular. It has been customary, however, and useful, to assume that 
some circuits contain, for example, only resistance or only capaci- 
tance, and so on. Where this idealization stops is partly a matter of 
taste, but the weakest assumption which appears to be useful is that 
the matrix L+R+S is nonsingular. 

The nonlinear networks considered previously were assumed to 
satisfy the equation 


(6) Ly” + V(y') + Sy =e 


where V(y’) was taken as a quasi-linear replacement of Ry’. The net- 
works to be discussed here are such that S=0. In the linear case the 
equation satisfied is 


(7) Li’ + Ri =e. 
In the nonlinear case the equation satisfied is 
(8) dV(i)/dt + Ri=e 


where V (i) is a quasi-linear replacement of Zi. In order to apply the 
theory developed in I and III, it is necessary to reduce equation (8) 
to that of the form of equation (6). Integrating equation (8) gives 


(9) V(y’) + Ry = p. 


Here p=/jedt+h where h is some constant vector of integration. 
Clearly, (9) is of the form (6) with L=0 and S=R. 
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THEOREM 1. Let R be an arbitrary symmetric definite matrix. Let 
e(t) be a finite integrable function for t=0. Then tf i, and 12 are two con- 
tinuous solutions of equation (8), 


f \|ix — < 
0 


Proor. The hypotheses given are sufficient to insure that relation 
(9) is valid. Moreover, it may be assumed that 4=0, because since R 
is nonsingular, there is a constant vector k such that Rk=h. Re- 
placing y by y+ achieves this result without changing the value 
of y’. Consider now the equation 


(10) Ly’ + Ry = 0. 


In order to apply the main theorem of I, it is necessary to know that 
all solutions of (10) are such that y’—0 as t>~. To show this, note 
that in a rotated coordinate system R is a diagonal matrix with 
positive diagonal elements. Now by making a simple transformation 
which merely shortens or lengthens the coordinate axes, R can be 
made to be the identity matrix, E. Another rotation of axes converts 
L to a diagonal matrix. Thus (10) breaks up into equations of the 
form +¥.=0, where 1,20. Clearly, then, as t>«. The 
transformations made in going to this new coordinate system were all 
nonsingular, so it follows that yg —0 in the old coordinate system 
also. Thus the theorem of I is now directly applicable and com- 
pletes the proof. 


THEOREM 2. Let R be an arbitrary symmetric definite matrix. Let 
e(t) be a continuous function of period 2x. Then equation (8) has one 
and only one solution for which 1 is a continuous periodic function. 


Proor. It is possible to write e=e,+s where the integral of e& 
over an interval of length: 27 vanishes and where s is a constant 
vector. Since R is nonsingular, there is a constant vector & such that 
Rk=s. Let i=i,+k. Then equation (8) can be written as dV;(i)/dt 
+Ri,=e, where V; is the quasi-linear replacement as given by the 
preceding lemma. Integrating this equation puts it in the form (9), 
where ~ is a periodic function with a bounded first derivative. 
Theorem 5 of III is now directly applicable, and it states that there 
is a continuous periodic solution 4. Theorem 1, above, implies that 
there is no other continuous solution of period 27 or, for that matter, 
of any other period. This completes the proof of Theorem 2. 


4. Interlinked electric and magnetic networks. The roman num- 
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eral III depicts a commonly used transformer core shape. Apparently 
the analysis given in §1 is not applicable for such unsymmetrical 
transformers. By making use of the familiar concept of a magnetic 
network, the conclusions of §3 will now be shown to remain valid, at 
the expense, however, of introducing an additional assumption. 

A magnetic network is formally analogous to a direct current 
electric network. First a maximal set of independent circuits is 
selected in the transformer cores. Then any distribution of flux in the 
cores can be expressed uniquely as a superposition of cyclic fluxes fi 
flowing in these circuits. This statement rests on the fact that 
div B=0. In the linear case the specific magnetic resistance is taken 
to be 1/y. In the nonlinear case the specific resistance function is the 
inverse of the permeability function. The magnetic network equations 
may then be written in analogy to the electric case. Expressed in 
vectorial form, they are: 


(11) m= M(f). 


Here m may be called the magnetomotive force vector. By definition, 
the component m/ is the line integral of the tangential component of 
the magnetic field around the jth circuit. 

Suppose now that this magnetic network is topologically inter- 
linked with an electric network. It is desired to relate the vectors m 
and f of “magnetic space” with the vectors v and ¢ of “electric space.” 
Introducing the linkage matrix, C, in exact analogy to the matrix C 
in §1, the generalization of relation (1) is: 


(12) v= Cf. 
On the other hand, Ampere’s line integral law demands that 
(13) m = 


Equations (11), (12), and (13) define an implicit relation between v 
and i. By this method of analysis VerPlanck and Fishman [3] are 
able to obtain explicit solutions of several magnetic amplifier net- 
works in terms of the basic parameters. 

If the magnetic network is constructed of ferromagnetic material, 
the magnetic resistors may be termed quasi-linear resistors, because 
the differential resistance will lie between positive limits. The con- 
siderations given in §3 show that M is a quasi-linear replacement. 
Since none of the branches have zero resistance, it follows that M 
has an inverse, U. Thus 


(14) v = CU(C*i). 
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The differential transformation is do=CU’C*di. But U is a quasi- 
linear replacement, so it is easy to see that the matrix CU’C* satisfies 
conditions (ii) and (iii). Thus (14) defines a quasi-linear replacement 
in electric space. The theorems in §3 are now valid, so it may be said 
that networks of unsymmetrical transformers and Ohmic resistors 
are reliable. 

The analysis in this section tacitly assumes that it is permissible to 
treat distributed parameters as mean lumped parameters. A justifica- 
tion of this assumption will be attempted in a later paper. This will 
involve rather lengthy considerations, because the solution of a non- 
linear Dirichlet problem is demanded. 
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“CHARACTERISTIC DIRECTIONS” IN THREE-DIMENSIONAL 
SUPERSONIC FLOWS 


N. COBURN 


1. Introduction. It is well known from H. Lewy’s work [1]! that a 
hyperbolic quasi-linear second order partial differential equation in 
two independent variables can be replaced by a canonical system of 
first order partial differential equations with the aid of the two 
families of characteristic curves. Recently, R. Courant and K. O. 
Friedrichs showed [2; 3] that a system of n-totally hyperbolic quasi- 
linear first order partial differential equations in two independent 
variables can be replaced by another system of such equations with 
the property that each of the latter equations involves differentiation 
in only one direction. The n-directions which possess this property 
are called “characteristic directions” by Courant and Friedrichs. 
The “characteristic directions” of this type are used by these authors 
in formulating a finite difference method for treating problems such 
as steady, non-isentropic, two-dimensional supersonic flows. 

It will be shown that no such “characteristic directions” exist for 
the following steady, three-dimensional supersonic flows: (1) non- 
isentropic, (2) isentropic, irrotational. In other words, any linear 
combinations of the governing first order equations for these flows 
must involve derivatives with respect to (at least) two directions. 
Further, two such directions are sufficient for the second type of 
flow. It is shown that any two directions lying in the planes which 
envelope the bicharacteristic cone are permissible. 


2. The flow equations—“characteristic directions.” We shall use 
tensor notation although the x* (A=1, 2, 3) coordinate system will be 
considered to be Euclidean orthogonal. The components of the 
velocity vector are denoted by »*(A=1, 2, 3); the metric tensor by 
g™; the permutation tensor density by &. Further, the pressure, p, 
will be considered to be a known function of the density, p, and the 
entropy, s. Hence, we shall introduce the quantities c*, b defined by 


(2.1) c= oP 


For steady motion, the equation of conservation of mass, the laws of 
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motions, and the condition that the entropy of a given gas particle 
be constant are 


(2.2) —— + = 
2 
(2.3) — — — = 
(2.4) = 0, 


The conditions for irrotational motion are 


= 0. 
Ox 


(2.5) 


If we multiply the equation (2.2) by one, (2.3), (2.5) by the un- 
determined vectors ?, w, respectively, and (2.4) by an undetermined 
scalar o, we obtain, upon adding the resulting equations, 


Ov, 
Ap }+y* 
(pg + bo + wae) 


Op Os 
p Ox p 


(2.6) 


For non-isentropic flows, w.=0, since (2.5) cannot be used; for 
isentropic, irrotational flows, b>=0, o=0, since (2.4) is identically 
satisfied. With this understanding, (2.6) represents the most general 
linear combination of (2.2), (2.3), (2.4), (2.5). Let 7 denote a tangent 
vector to a congruence of space curves. That is, 


(2.7) a =—-, 


where s is some parameter along any curve of the congruence. The 
condition that (2.6) involve differentiation with respect to s leads to 
the equations 


(2.8) pg + + wee = prir, 
2 

(2.9) = 
p 

(2.10) ov + — = mi" 


p 
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where /* is an additional undetermined vector and k, m are additional 
undetermined scalars. For non-isentropic flows (w.=0), we solve 


(2.9) for and substitute this result into (2.8). This leads us to the 
equation 


p kp 
(2.11) — + — = pri. 
c? c? 
Let 
G = 1, 2) 


denote two independent vector fields orthogonal to #. Forming the 
scalar product of (2.11) with 


tata, 
we obtain 
(2.12) = (PH) j, k = i, 2. 
Since 


i 


consists of two independent vector fields, equation (2.12) implies a 
contradiction. Hence, we conclude that “characteristic directions” do 
not exist for non-isentropic, three-dimensional flows. For isentropic, 
irrotational flows, b=o=0 but w, need not vanish. As in the previous 


case, we solve (2.9) for /* and substitute the result into (2.8). We ob- 
tain 


k 
(2.13) (c2g — + Pot + wee = pri, 


Forming the scalar product of (2.13) with 


ir, 
we find 
(2.14) c= j=1,2. 
The equation (2.14) states that 

He (j = 1, 2) 


— 
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belong to the characteristic normal cone. Since 

i 
may be taken to be any two distinct vectors in the plane perpendicu- 
lar to # at any point P, the normal characteristic cone must de- 
generate to a plane at any such point. But this implies a contradiction 
since the normal cone for isentropic, irrotational flows is a right 
circular cone with axis along the velocity vector. Thus, “character- 
istic directions” do not exist for isentropic, irrotational three-dimensional 
flows. 


3. “Characteristic pairs of directions.” In this section, we con- 
sider isentropic, irrotational three-dimensional flows. If two distinct 
congruences of curves with tangent vectors 7", j* exist such that linear 
combinations of the governing partial differential flow equations 
can be expressed as linear combinations of derivatives along these 
two congruences, then we say that a “characteristic pair of direc- 
tions” exists. Some simplicity is obtained by eliminating p between 
(2.2), (2.3), where b=0 since s=constant throughout the flow field. 
Thus, (2.2), (2.3) may be replaced by 


(3.1) a’ — = 0, = — 


Our problem reduces to the question of determining when three 
vector fields wa, pa, ga and two congruences of curves ia, j« exist such 
that 


(3.2) + weer = prit + 


If k* represents a vector field orthogonal to 7, j* and r* some un- 
known vector field, then the tensor a“+w.e® can always be ex- 
pressed in the form 


(3.3) a + = Pik + + 
It is easily verified that 
(3.4) (a + k,. 


Similar expressions involving 7, j* (instead of k*) are valid for p*, @ 
(when #, j* are orthogonal). The necessary and sufficient condition 
that (3.3) reduce to (3.2) is that r*=0. That is, 


(3.5) Wale™hk,) = — ah,. 


Since ek, is an antisymmetric tensor, the rank of this tensor is 


| 
( 
Ov 
= 
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equal to zero or two. If k, is not identically zero, the rank is two. 
Evidently, (3.5) will possess 1! solutions for w. if and only if the 
rank of the augmented matrix of (3.5) is two. Direct computation 
or use of the e*“ symbols shows that 


(3.6) k, = 0 


is the necessary and sufficient condition for this to be valid. But (3.6) 
implies that &, lies in the normal characteristic cone. Hence 2, 7* 
must lie in a plane tangent to the bicharacteristic cone. It is evident that 
if (3.6) is satisfied, then ©! vectors w. can be obtained satisfying 
(3.5). For any such vector w., equations similar to (3.4), for appro- 
priately chosen 7*, 7*, determine p*, g so that (3.2) is satisfied. 

The above results furnish some additional insight into the method 
used by C. L. Dolph and the author in their study of three-dimen- 
sional supersonic flows. In their study, it was found that a char- 
acteristic system can be constructed from the geometry of the char- 
acteristic surfaces. However, the equations of the system contain 
derivatives with respect to two directions. This is essentially dif- 
ferent from the two-dimensional case, where only one directional 
derivative enters into each equation of the characteristic system. But 
in view of the above results for three-dimensional flows, it is seen that 
no smaller number of derivatives than two can enter. 
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SOME THEOREMS ON MEROMORPHIC FUNCTIONS 
YOSHITOMO OKADA 


Let f(z) be an integral function of finite order p20, and let n(r) 
denote the zeros of f(z) in | s| Sr. Set 


M(r, f) = max | f(z)|. 
lel|Sr 
Pélya! proved 


log M(r, 
lim dat < 


n(r) 
if p is not an integer, and Shah? proved 


log M(r, 
tim inf _ 


ro 
if f(z) is a canonical product of genus p such as p=), and ¢(x) is 
any positive continuous nondecreasing function of a real variable x 
such that [dx/x(x) is convergent. 
In this paper, some similar theorems will be obtained for mero- 
morphic functions. 


Let 
(1) F(z) = f(2)/g(2) 


be a meromorphic function of finite order, where 


2) 


g(z) 


ll 
by 
S| 
=) 


+--+) 

n=1 2 b,, q b, 

these being canonical products of genera p, g and of orders pi, ps, 
respectively. Let s be the genus of (1), that is s=max (9, g), and de- 
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note the number of the totality of a1, a2, - - - and by, be, - - - in | s| Sr 
by N(r). Moreover set 


M(r, F)=M(r) = max | F(z) |. 


Then we have the following theorems: 


THEOREM I. If for any given meromorphic function (1) of finite 
order 


max (p1, p2) = = 0, 


then 


(2) lim inf log* M (t)dt 0, 


where $(x) is any positive continuous nondecreasing function of a 
real variable x such that [°dx/x(x) is convergent. 


THEOREM II. If for any given meromorphic function (1) of finite 
order max (1, p2) is not an integer, then 


1 
(3) lim inf 
r+o rN(r) 


f logt M(t)dt < 
0 


To prove these theorems we establish the lemma: 


Lemma. For any given meromorphic function (1) of genus s, let 


C1, C2, - - denote the totality of ai, a2, - -and by, be, in | =F. 
Then 

1 r > 
(4) —f logt M(t)dt Sh 

rJo Jen +7) 


when s2=1; and for a number k>1, 


1 kr 
(5) —f log+ M(t)dt S he log (1 : 
rJo n=1 | Ca | 
when s=0. 


Proor. Assume s = max (p, g) 21. Since s= and is the genus of 
f(z), for a certain polynomial P(z) of degree <s we get 


n=1 n 


3 Hereafter we assume that hi, he, - + - are all some suitable fixed positive con- 
stants independent of r. 
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whence 
logt M(r, f) S + [] E(=, s)| 
n=1 an 
yeti 
6 
(6) = t+ ‘2 
yeti 
sh 
“= | a, |*(| Gn | r) 
Similarly 
(7) log+ M(r, g) he 


n=1 | |*( | | + r) 
Now, by the theory of meromorphic functions, we get 

1 
(8) —f log+ M(t)dt < C(k)T(kr, F), 

where 7(r, F) is the characteristic function of F(z), and C(k) depends 
on a number k>1 only, and also we have 

T(kr, F) = T(kr, f/g) S T(kr, f) + Thr, g) + O(1) 

log+ M(kr, f) + log+ M(kr, g) + O(1). 


Hence from (6), (7), (8), and (9) follows (4). 
Next, assume s=0. Then by the same process we get (5). 
PROOF OF THEOREM I. By the lemma, when s=0 we get 


N(t) 
dt; 
+ 1) 


for, when s21, from (4) this is obviously seen, and when s=0, from 


(5) we have 
1 * 6krN(t) 
—f log+ M(t)dt S inf — dt 
r 0 0 i(t + kr) 


N(t) 
kis — dt 


On the other hand the integral function 


(9) 


IIA 


1° 
(10) f logt M(t)dt S f 
0 0 


co 


n=l 


| 
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is of order p= max (1, p2), since G4, C2, - - - are composed of aj, de, - - - 
and 0, be, - - - , whose exponents of convergence are pi, p2 respec- 
tively; and the genus of G(z) is s=max (p, g), since f(z) and g(z) are 
of genera p and g respectively. Therefore by Shah‘ 


1 
lim inf ————— f 
rae N(r)o(r) +1) 
for p=s20. Hence by (10) we get (2). 

Proor oF THEOREM II. The order and genus of integral function 
G(z) are p=max (1, p2) and s=max (, gq), respectively, as already 
seen, so that the exponent of convergence of |c|, |c2|, - - - is p, and 
p>0, s=[p], since p is not an integer. 

Now set ®(x) for 1<p, and ®(x) =hz log (1+/x) 
for 0<p<i. Then ®(x) is a positive and decreasing function for 
x>0, and we have 


< (0<n<p-—s) 
for all values of x>0 near x=0, and 
<x (0<7<s+i1-—p) 


for all values of x sufficiently great. Therefore by Pélya® 


lim inf > o(! ) < 
0 


N r) n=1 r 


Hence from the lemma 


lim inf f logt M(t)dt lim inf > ) < 
rN(r) 0 N r) n=1 Tr 


ToHoKu UNIVERSITY 


4 Shah, loc. cit. pp. 26-30. 
5 Pélya, loc. cit., Theorem VI, pp. 173-174. 


REPRESENTATION OF FOURIER INTEGRALS AS SUMS. II 
R. J. DUFFIN 


Let the functions f(x) and g(x) be defined in terms of the function 
(x) by the sums: 


(1) fa) 


1 n 


and 


Ge n 
(2) 
1 x x 
Here a, =sin (rn/2), n=1,2,---. 
In a previous paper [1]! it was shown that under quite weak re- 
strictions on ¢: 


(3) g(x) = xt f(t)dt 


(4) f(x) -f sn xt g(t)dt. 


Here sn x is an abbreviation for sin (xx/2). 

This paper gives other conditions for the validity of these identities. 
The previous conditions permitted ¢ to have various types of dis- 
continuity. The present paper is concerned with smooth functions; 
however, the growth at 0 and & is permitted to be greater than be- 
fore. Theorems 1, 2, and 3 of the previous paper together with 
Theorems 2 and 3 of this paper form a fairly complete elementary 
theory of these identities. The proofs given here do not depend on the 
previous paper. 

The results of this paper hinge on the possibility of defining the 
Fourier sine transform for functions which do not vanish at infinity. 
Theorem 1 below shows that this is possible merely by employing 
summability. It is to be noted that Theorem 1 is not true, as it stands, 
for the cosine transform. For example, the cosine transform of any 
constant evaluated by such a definition would vanish. Hence the 
inversion formula could not apply. A theory of “generalized Fourier 


Presented to the Society December 30, 1948; received by the editors November 
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integrals” has been developed by Fry, Hahn, Wiener, Burkill, 
Bochner, and others [2]. These writers do not differentiate between 
sine and cosine transforms, and it appears that their results contain 
no equivalent of Theorem 1. 


THEOREM 1. Let f(x) be a function whose derivative f'(x) is of bounded 
variation in the interval (0, ~), and let f'(x) vanish at ~. Then the 
integral (3) is summable C, to define a function g(x). The integral (4) 
is improperly integrable at 0 and gives f(x) for x>0. 


Proor. A function which satisfies the conditions of the Theorem 
and also vanishes at 0 will be said to satisfy Conditions B. It is quite 
easy to see that the theorem is true for f(x) a constant, so it is suffi- 
cient to prove the theorem under Conditions B. Two integrations by 
parts show that 


k k 
(5) f sin ¢ f(t)dt = — cos k f(k) + sin k f’(k) — f sin ¢ df’(t). 
0 0 


The integral on the right converges absolutely as k—> ©. Therefore the 
integral on the left is summable C, provided the first two terms on the 
right are summable. But 


sinh 
=f cos k f(k)dk = sin k f’(k)dk. 


Because f’ vanishes at © it follows that the Cesaro mean of the term 
cos k f(k) vanishes as h->~. For the same reason the Cesaro mean 
of the second term on the right in (5) vanishes. Thus for x>0 


4 
(6) f sn xt f(é)dt = — <5) sn xt df’(t). 


It is worth noting that a simpler type of summability will also estab- 
lish (6). It is sufficient to take the limit in (5) for k=av/2; v=1, 
3, 5, -- +. This causes the undesirable term cos k f(k) to drop out. 

Denote the function defined by (6) as g(x), and let b and c be posi- 
tive constants such that b212c. Then 


b 
(7) fi» = -=f f 


c 


The inner integral on the right may be written 


> sn xsn xt 1 sn x — wx/2 1 «sn xt 
f ast f sn xitdx + dx. 
1 x? 2x 


— | 
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It is obvious that the first two integrals converge uniformly in ¢ as 
b—« and c—0. It is easy to show that the third integral converges 
uniformly for ¢ in any finite range and that for all ¢ it is uniformly 
bounded. Thus relation (7) is valid for b= © and c=0 provided the 
integral on the left is evaluated as an improper integral at the lower 
limit. As is well known 


4 sn xsn xt 0474981, 
(8) =f = | 
0 x? #=> 1. 


Substituting (8) in (7) gives f(1)=J> sn x g(x)dx. The observation 
that f(kx) for k>0 satisfies the Conditions B and has the transform 
g(x/k)/k clearly completes the proof. 


THEOREM 2. Let o(x) be differentiable for x21. Let o'(x) be of 
bounded variation in the interval (1, ~), and let o'(x) vanish at ~. 
Let the same conditions be satisfied by ®(x) =@(1/x)/x. Then for x>0 
the series (1) and (2) are summable C, and satisfy (3) and (4). The 
integrals are summable C, at ~ and improperly integrable at 0. 


Proor. Consider first the special case when ¢(x) satisfies Condi- 
tions B. Let a function f’(x) be defined by the series 


(9) = > (=) 


The function ¢’ is bounded, so the series is uniformly convergent 
and may be integrated termwise. Hence 


BS 
-$24(5) 


Thus the series (1) converges and defines a function f(x) whose deriva- 
tive f’(x) is given by (9). From (10) it is seen that f(x) vanishes at 0. 


From (9) it is seen that f’(x) vanishes at . Let V denote the opera- 
tion of taking the variation on the interval (0, ©). Then from (9) 


x 
(11) va) (=) < 2V¢6"(2). 


1 
From these considerations it follows that f(x) defined by (1) satisfies 
Conditions B if @(x) does. 
Let f,,(x) denote the first m terms of the series (9), Then 
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f= t dfa(t) = t dq’ (-) 


0 


1 


(12) 


From (11) it is clear that V(f’—f,)--0 as m—~ ; hence the integral 


on the left in (12) approaches f> sn ¢ df’(t). The series on the right 
converges uniformly. Thus 


4 
(13) = f sn t df’(t) = f a(t) d¢’(2). 


Here o(t)= >-f (4a,/(xn)*) sn nt is a well known series: 
o(t) = ¢/2, oft) = 1 andso on. 
Thus 


f o(t) + 4'(5) =2 f do(?) 


= $(1) — [¢(3) — 4(1)] + [6(5) — 4(3)]. 


But ¢’(x) vanishes at ©, so in general 
(14) -f a(t) dd’ (t) = lim >> and(n) — ap(v)/2; v= 1,3,5,---. 
0 1 


It is easy to show that a series summable by the method indicated in 
(14) is also summable by Cesaro means when use is made of the fact 
that ¢(v)/vy—0. 

Combining (6), (13), and (14) gives 


fos t f(i)dt = ano(n). 


Replacing ¢(x) by $(x/k)/k in this relation proves (3). Theorem 1 
guarantees (4). 

Turning now to the general case, it is clearly possible to write 
(x) =di(x)+¢2(x) where gi(x) and ®2(x) =¢2(1/x)/x satisfy Condi- 
tions B. Thus 


+,(=) a= = £(2). 


n 1 x 


| 
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Likewise G2(x)=f2(x). Thus f(x)=fi(x)+G.(x) and g(x) =g,(x) 
+ F,(x), so the validity of (3) is guaranteed by the validity of (3) 
and (4) in the special case. The same is true of (4). 


THEOREM 3. Let (2) be analytic at interior points of the sector 
—Asarg 235A for some A>0, and in this sector suppose | p(z)| 
OSk<1. Then: 

(a) (x) satisfies the conditions of Theorem 2. 

(b) The functions f(z) and g(z) satisfy the same conditions as ¢(2) 
in the sector —5 Sarg 255 <A. 

(c) If k=0 and $(x) =0(1+| x|-) at 0 and ~, then f(x) and g(x) 
satisfy the same relation. 

(d) In case (c) the series and integrals converge without recourse to 
summability. 


Proor. Let z be a point in the sector 5; then by Cauchy’s integral 
formula for ¢’’(z) with a circular contour of radius proportional to 
|z|, it follows that |¢’’(z)| =O(|2| *-?+|2|--*). Thus /?|6’’(x)| dx 
<«, so $’(x) is of bounded variation in the interval (1, ©). Simi- 
larly, Cauchy’s formula for $’(z) shows that $’(x) vanishes at o. 
Clearly ®(z) satisfies the same conditions as ¢(z), so (a) is proved. 

Formula (14) may be modified so as to hold for functions which 
do not vanish at 0. It is convenient to let p(t) =1/2—o(t). Then it is 
easy to see that for z in the sector 6 


(15) g(z) = s(—)+5f (=)o t. 


Introducing the bound for $”’ gives 


Thus | g(z)| =O(|s|*+|2|-*-*). The uniform convergence of (15) 
guarantees that g(z) is analytic at interior points of the sector 6. 
Similar considerations apply to f(z), so (b) is proved. 

From (15) it is easy to see that if 7 is an integer of the form 4n+1 


n 1 j 1 t 
(16) = 6(2) +5 fo" (=) 


But if k=0, |¢’’(z)| =O(1+|2z|-), so 


| e(x)| < + 7 


1950] REPRESENTATION OF FOURIER INTEGRALS AS SUMS. II 


nn 


By choosing j sufficiently large, the last two terms may be made as 
small as desired for all x21. By the assumption in (c) it is clear that 
the series vanishes as x>~. Thus g(x) vanishes at ». The same 
inequality shows that x g(x) vanishes at 0. A similar argument applies 
to f(x), so part (c) is proved. 

It may be seen from (16) that since ¢(x)—>0, the series for g(x) 
converges in the ordinary sense. An integration by parts similar to 
that used in deriving (5) shows that because f(%) and g(k) vanish at 
©, the integrals converge in the ordinary sense at ©. Part (b) shows 
that the integrals (4) and (5) are absolutely convergent at 0. 
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INEQUALITIES CONCERNING ULTRASPHERICAL 
POLYNOMIALS AND BESSEL FUNCTIONS 


OTTO szAsz! 
1. Introduction. Ultraspherical polynomials are defined by the re- 
currence formula 
(1.1) ( + 1)Prys(x) = 2(m + — (m + 2d — 1)Pr-a(x), 
with P}=1, P}=2dx. Thus we have 


2P2 = A)x — 1], 
and so on. 
For \=1/2 we get the Legendre polynomials P,(x). They satisfy 
the following interesting inequality: 
An(x) = [P,(x) }* 
— 20, n21,-1 
with equality for x= +1. This result is due to P. Turdn, as stated 
in a recent paper by G. Szegé,? where four elegant proofs of the 


theorem are given. The first proof (similar to that of Turdn) is based 
on Mehler’s formula, while the third proof is based on the identity 


(1.2) 


IIA 
lA 


n=0 nN: 
and on the fact that the Bessel function J» has all its roots real. 
Szegé remarks that the formula 
n=0 P*(1) 
= + 1/2)e7*[(1 — — > — 1/2, 


and analogous formulas for Laguerre and Hermite polynomials yield 
inequalities analogous to (1.2) for ultraspherical, Laguerre, and 
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Hermite polynomials. Thus 


(1.3) | > 0, 
PX(1)J 
We have,’ for A> —1/2, 
2AT(n + 2d 
(1.4) P»(1) = = 
(2d + 1) 


so that (1.3) can be restated as 


n(n + 2d)[P.(x)]° = (m+ 1)(m + 2d — 
with A\=1/2, this reduces to the inequality (1.2). 
We write F,,(x) for P®(x) /P&(1); then (1.3) becomes AX(x) = [ }? 
— F,-1(x) Fn4:(x) 20, and equality holds for x =1. We have 
2(1 + — 1 
1+ 2A 


=1, F,(x) = x, F.(x) = 


For 0<A<1 we derive a sharper inequality, employing essentially 
the recursion formula (1.1). The result is 


— F.(2) © 
(1.5) 
(n + — 1)(n + 2d) A+1 2d) 
In particular, for \=1/2, 
1 — |P, 2 2 1 
[P.(x)] n+ 


3n(n + 1) 


the smallness of A, indicates that the inequality (1.2) is rather deep. 
For Bessel functions we shall establish the inequality 


(2n —1)(n+1)— 


2 1 
(1.6) J ,(t) (OF > —— 0, t real. 
ut+i1 


It seems particularly interesting that the procedure is very much 
the same in deriving (1.5) and (1.6), and is based mainly on the re- 
spective recurrence relations. 


2. Ultraspherical polynomials; \<1. From (1.1) and (1.4), we ob- 
tain 


3G. Szegé, Orthogonal polynomials, Amer. Math. Soc. Colloquium Publications, 
vol. 23, 1939, pp. 80, 166. Referred to as O. P. 
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+ 2A)(n + 1) 
+ 1+ 2a) 
I'(n + 2A — 1)n(n + 1) 
T(n+1+2) 


(m + 1)Faii(x) = 2(n + A)aF,(x) 


— (m + — 


or 
2(n + A)x n 
(2.1) +1(2) 2A (x) 1(x) n 
It follows that 
2(n + A) n 
(2.2) = xF F,-1(x)F (2). 
From (2.1) we get | 
2 A-1 —1 
n+2rX—1 
whence substitution into (2.2) yields 
2 
n+ 
2(n + A — 1) n—1 \ 
n 2(n+A— 1) n—1 
2nx(n + A — 1) 2 2(n +X — 1) 
(n + 2d — 1)(n + 2d) — 
—1 2 
n+2A\-—1 n+ 2r n-+ 
2n(n +r — 1)x 2 n+rA—1 
= | 2% ——— 
2(n — 1 


(n+ D— 1(n+ D) 


If we write 


[April 
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(m+ —1)(0 + = Da(2), 
then 


Da(x) = n(n +d — 1)Fe-a(z) — — + 
= {(n +2 — 1)(m + — 2)Fe-a(2) 
— (n+r— 2)(n + 2A — 1)F,-2(x)F n(x) } 
(n — 1)(n + dA) 
2A—1) 
(n+ 1)(n + 2A — 2)(m — 1)(n + A) 
— 2)(n + 2A — 1) 
(nw — + dA) 
(nm + Xr — 2)(n + 2A — 1) 
n(n +d — 2)(n + 2A — 1) — (wm — 1)(n + A)(m + 2A — 2) 
(n +d — 2)(n + 2A — 1) 
(nm — 1)(n + A) 
— 1)(n +d — 


+ F,-1(x) 


= +(n+rA- 1)F.-(x) 


Let 
(n + dX — 2)(m + 2A — 1) 
2.4 n = §n— = Zz, i= 1; 
(2.4) = +d n g 
then 


n+rA-—1 


n—1(x), 


where, from (?.3), 


D3(x) = + A)Fi(z) — A(1 + 2)FoF2(x) 
= \{2(1 + A)x? — 211 + + 1} 
= X, 


(2.6) 


From (2.4), we have 


| 
) 
| 
\ 
\ 
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MA+ @+ 2-1 
— + 2A — 1) he 
= 


(» — + d) 
(2.7) dA + 1) nl 
= 1+—) = O(n”), 


as @, 


It is well known [cf. O. P. pp. 80, 166] that, for A>0, 


A 
max | P,(x)| = Pa(1) = Casor—1.0» 


so that 
(2.8) | F.(x)| F,(1) = 1. | 
Hence 
(2.9) < = = O(n). 


We now consider the case 0<A<1; it follows from (2.9) that the 
series | 


2 


is uniformly convergent in —1<x 31, and from (2.5), (2.6) we have 


= (n+A)gn _2 
\ — lim g,D,(x) = 2X(1 — A F , (x). 
(2) n(n +2r+ 1) 


We write 
(n + A)gn 2 
lim g,D,(x 20-2 


= x(x), 


a continuous function of x in —1Sx31. It follows from (2.8) and 
(2.10) that 


(2.11) x(a) & x(t) = 041 2(1 n(n 


We next show that 


260 
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(2.12) x(1) = lim g,De(1) = 0. 


From (2.3), we have 


(2.13) D,(1) = (n+ d)(n + 2A — 1) — — 1)(n+2)=d, 


and from (2.7), for 0<A<1, we see that g,—0; this proves (2.12). 
It now follows from (2.11) that 


+ A)ga 
(4) 20=1-2(1-—A 
so that 


1 n(n+2A+1) 2(1 — A) 


Note that by (2.7) we have 


AMA + 1)P(m + 2d) AA + 1) 


gn = 


From (2.5) and (2.6) we get 


2 
(2.14) = — 2(1-2») > FAs). 


tA+1) 


furthermore 
| 
| g(v + 2 
= x(x — A F, (x). 
| = x(x) + 2A( (x 
Now from (2.8), (2.12), and (2.13) it follows that 
gv + 2 
= — {1 — Fas) 


> + A) g(v + d) 


M 


\ = gnD,(1) = 
It follows from (2.14) that, uniformly in 1, 


} (2.15) x(x), asn— 0; 


furthermore, 


| 
| 
| 
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(2.16) D,(x) = D,(1) 


ll 


From (2.15) we have 
beDa(2) < = 
From (2.3), (2.13), and (2.16) we obtain 


= (nm +d — 1)(m + 2d) {Falx) — } 
+ 2 2, 
so that 
A(1 — F(z) 
+A — 1)(m + 2d) 


An(x) = — 2 


and 


2 T(1 + 2a) 


n A,(x) x(x), 


uniformly in 
Summarizing, we have the following theorem: 


THEOREM 1. Denote by Px(x) the ultraspherical polynomial and let 


n(x) = n(x) = F(x) — 

PX(1) 1 1 
If 0<X<1, then n™AX(x) tends uniformly to a non-negative function, 
and 

2 

— Fi fx T(1 I'(n) 
( < (1 + 2A) nm) 
(n +A — 1)(n + 2d) A+ 1 + 


—1s 


IIA 


In particular for \=1/2 (the Legendre polynomial), we have 


3. The case A>1. Let now A>1; from (2.14) we have 


= d+ DO — 1) 


[April 
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so that g,D>(x) increases as m increases. Hence 
gnD a(x) 2, 
or 
(m+ — 1)(m + 2d)gwAn(z) + AgaFa(x) ZA = D,(1). 
Furthermore 
| + A) 
n(x) S = A + 2A(A — 1) = 
} Thus 
2 
M,, 
(n +d — + 2A)gn + — + 2A) 
and 
2 
— F, 
< ( (x)) 
(x +r — + 2A) 
In this case, from (2.7), we have g, J ©, and 
A,(x) = O(n ), as n— 


Finally, for \=1, we have 


sin (x + 1)@ 


x = cos 8, 


x 
= g.Di=1, g=1, =1, 
n(n + 2)A,(x) = 1 — F,(2). 
4. An application. If P,4:(x) =0, then from (2.17) we obtain 


1 — P,(x) < 2n+1 
(nm + 1)(2n — 1) 3n(n + 1) 


or 


1 2 2n + 1 
n(2n + 1) 3n(n + 1) 


If P,(¢)=0, then 


| 
| 
| 
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1 
(n(2n + 1))*/? 


which yields a lower bound for [x—i]. To this end we employ the 
formula [O. P. p. 83] 


< | P.(x)| = if (udu), 
t 


< Pr(x) = 
so that, for \=1/2, 
(x) = 
hence, from (1.5), for \=3/2, 
| (x) | S = (3/2)n(m + 1); 


equality holds for x= +1. 
It follows that 
1 


or 
| —t| > 2/3n-8/2(2m + 1)-/2(m + 1)-, 
where Py+i(x) =0, =0. 
5. Hermite polynomials. The situation is simpler for Hermite poly- 
nomials H,(x), defined by 
= 22H, (x) — 2nH,-1(x), m= 1,2,3,---, 
Hy, = 1, H, = 2x. 
In this case [see Amer. Math. Monthly vol. 55 (1948) solution of 
problem 4215, pp. 34-35], 
1 | 2 
—— {Hays — = — 


hence monotone increasing, as increases. 


6. Bessel functions. The Bessel function of order yu is defined for 
p> —1 by the power series 
= (—1)"(¢/2)” 


(6.1) = (¢/2)" 
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It satisfies the recursion formula 


2 
(6.2) Ina) = Ja, u>0. 
We write 
(6.3) A,(t) = + = A,(—24). 
It then follows from (6.2) that 
(6.4) = A,(4) — A,-1(4), 
4u(u + 1) : 
and 
2 
A,(t) = ———— A, + 
(¢) + 1) 1(t) A, (4) 
4u(u — 
| + a(t). 
Thus, using (6.4) pa: we have 
| 2 
4u(u — 1) 
| + A,-i(A,-1 — A,-s), 
and 
2 4u(u = 1) 
| A, — = Ay—1(Ay—1 — 
poi 4u(u + 1) 
(6.5) A,-2{ Ay A,-1} 
4u(u—1) 4u(u — 1) 
If we put 


2 
D,(t) = Ay — 
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} 
| 

and 


OTTO SzAsz 


2 
A, (2) = Ay Ay—1Ay+1, 


then we have 


4u(u — 1) 
(6.6) D, = —— A,-1, 
i? 
and 
p-—l 2 2 
4, =D, ‘ 
Now from (6.5) we get 
— 1) 8u(u—1) 2 
u(t) (u 1 ) (u 1 ) 
hence 
2 2(u—1) 2(u—1) 2 
t D,(t) BD 


It follows that the sequence 

P(u + v)Dy+o(t) 
+y — 1) 


is monotone decreasing as v increases. Furthermore 


Qu+r(t) 


y=0,1,2,--- 


2 8 
D,(t) = Ai(é), D2(t) = Al. 


It is known‘ that 


| Ay(é) | A,(0) = 1, t real; 
hence 
and, from (6.7), 
2 
t A, »(t) 
= = i. 


— 


“See S. Minakshisundaram and Otto Sz4sz, On absolute convergence of multiple 
Fourier series, Trans. Amer. Math. Soc. vol. 61 (1947) pp. 36-53, in particular 
formula (2.3). 
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the series being uniformly convergent. It now follows that 

D,(t) > 0, for uw > 1, ¢ real; 
hence, from (6.6), we have 

A,-1(¢) > 0, 1, 
or, in terms of the Bessel functions, 


J ult) — > 0, real. 
ut+i 
If, for example, u.=3/2, we get 
t sin* — 3 cos «(sin — cost) > 0 fori > 0. 
We state the main result of this section as: 


THEOREM 2. Jf J,(t) denotes the Bessel function of index p, defined 
by (6.1), then, for u>0 and all real values of t, 


| 
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ON BOUNDED ANALYTIC FUNCTIONS 
ZEEV NEHARI 


The objective of this paper is to give an alternative derivation of 
results on bounded analytic functions recently obtained by Ahlfors 
[1] and Garabedian [2].! While it is admitted that the main idea to 
be used is more in the nature of a lucky guess than of a method, it 
will be found that the gain in brevity and simplicity of the argument 
is considerable. As a by-product, we shall also obtain a number of 
hitherto unknown identities between various domain functions. 

The basic problem treated in the above-mentioned papers is the 
following generalization of the classical Schwarz lemma: Given a 
finite schlicht domain D of connectivity ” (n21) in the complex z- 
plane and a point ¢ in D, to find a function F(z) with the following 
properties: (a) F(z) belongs to the family B of analytic functions f(z) 
which are single-valued and regular in D and satisfy there | f(z)| <1; 
(b) | F’(6)| =|’), where f(z) is any function in B. Evidently, it is 
sufficient to solve this problem for any domain D’ which is con- 
formally equivalent to D. In particular, we may therefore assume, 
without restricting the generality of what follows, that D is bounded 
by analytic curves. 

It was shown by Ahlfors that F(z) yields a (1, 2) conformal map- 
ping of D onto the interior of the unit circle and that »—1 of the n 
zeros of F(z) coincide with the zeros of a single-valued function h(z) 
which is regular in D with the exception of a simple pole at z=¢ 
and satisfies —ih(z)dz>0 on the boundary I of D; the mth zero of 
F(z) is located at z=f¢. It was subsequently noticed by Garabedian 
that the function h(z) can be written in the form h(z) = F(z)q(z) 
where q(z) —(z—¢)~? is regular in D and that the extremal property 
of F(z) can be deduced in a very elegant manner from the resulting 
inequality 


1 
(1) — F(z)q(z)dz > 0, zEr. 
i 


Indeed, if f(z) is in B and is continuous on I’, we have, by the residue 


theorem, 
s | g(z)ds | = | F(z)q(z)ds | 
2rJr 2x Jr 


Received by the editors October 19, 1948 and, in revised form, January 8, 1949. 
1 Numbers in brackets refer to the references cited at the end of the paper. 
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1 
f F(2)q(2)dz = 


where (1) and the fact that, on I’, | F(z)| =1 have been used. 

The inequality (1) not only solves the problem stated above but, as 
likewise shown by Garabedian, it may also be used in order to estab- 
lish the connection between F(z) and the Szegé kernel function [4] of 
D. This connection, in turn, leads to a possibility of effectively com- 
puting F’(¢) in terms of a certain complete orthonormal system of 
functions in D. 

Everything depends therefore on the proof of the existence of two 
functions F(z) and g(z), where g(z) has the required double pole and 
| F(z)| =1 on TI, such that (1) is satisfied for z€IT’. We shall carry 
through this proof by giving, with the help of an allied simple ex- 
tremal problem, a construction of these functions in terms of the 
Green’s function and the harmonic measures of D. 

We use the following notations: p(z)=p(z, ¢)=g(z, 
denotes the analytic function whose real part is the Green’s function 
g(z, ¢) of D, that is, the harmonic function which is regular in D, 
apart from the point z={ where g(z, ¢)+log |s—¢| is regular, and 
vanishes for z€ D. w,(z) =w,(z) +4; (z) denotes the analytic function 
whose real part is the harmonic measure w,(z) of the boundary com- 
ponent I’, (v=1, 2, - - - , m), that is, that harmonic function which is 
regular in D, is equal to 1 for z€T, and vanishes on all the other 
boundary continua. We further use the notations u(z)=u(z, §) 
=0p(z, £)/0E, v(z) =v(z, =(1/2)0p(z, £)/dn, It is easily 
confirmed that, with the exception of a simple pole of residue 1 at 
z=, both u(z) and v(z) are free of singularities in D and that, 
furthermore, Re { =Im {v(z, ¢)} =0 for 

Since the real parts of the functions p(z), w,(z), u(z), iv(z) are con- 
stant on I, the differentials 


1 1 
— p'(z)dz, —wy(z)dz, —u'(z)dz, v'(z)dz 
t 1 
are real there. The same is therefore also true of a linear combination 


1 
— t(z)dz 
1 
(2) 
=— cs 2ru’(z) + isin 2A0'(z) — ap’(z) + W, | dz 


1 


v=1 


with real A, a, , Since consists of analytic curves, the 
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functions u(z), v(z), p(z), w,(z) are regular on !—an immediate conse- 
quence of the Schwarz reflection principle. It is further well known, 
and easily proved, that ip’(z)dz is not only real but also positive on 
I’. If, in (2), A, wa, are kept constant and a (a>0) is taken 
large enough, the differential #(z)dz will therefore satisfy 


1 
(3) — t(z)dz = 0, cer. 
1 


We now pose the following extremal problem: For given X, to find 
the differential (2) which minimizes a under the condition (3). It was 
shown by the author [3] that this problem has a solution, say 
4-4 (z)dz, and that the function 4(z) belonging to this extremal dif- 
ferential has a double zero on each boundary component [I,, v 

It is readily confirmed that, in the vicinity of z=¢, the function 
4(z) has the expansion 

ez 


(4) h(s) = 


+ regular terms. 

We shall now show that &(z) is free of zeros in D. In view of (3), the 
expression {(z)dz does not change its argument if z describes the 
boundary TI of D. If A arg & denotes the total change of the argument 
of 4 along I, and A arg dz has a similar meaning, we have therefore 


A arg 4 = — A arg dz. 
Since A arg dz= —2r(n—2), it follows that 


(5) aA arg 4, = n — 2. 
By the argument principle, this is equal to the number of zeros of & 
in D minus the number of its poles there, plus a contribution of 1/2 
for each zero of 4 on I’. Since & has m double zeros on I and a double 
pole at z=¢, it follows from (5) that & has no zeros in D and, more- 
over, that 4 does not vanish on I at points different from the m double 
zeros already located. 
Our next step is to prove that the function 
pid 


(6) or(z) = = — 


--+ regular terms 


is single-valued in D. For this purpose we have to show that, in addi- 
tion to the fact just established that o,(z) has no zeros in D, the argu- 


$ 
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ment of the function o,(z) returns to its initial value if 2 describes a 
boundary component I, (v=1,---, ) of D. This is equivalent to 
showing that the increment of arg 4 along I, is an even multiple of 
2m. That this is indeed the case is easily shown as follows: If we start 
from the point on I’, at which the double zero of ft, is located and 
describe I’, in the positive sense with respect to D, the total variation 
of arg 4 is, in view of (3), equal to the negative value of the total 
variation of arg dz, that is, it is either 27 or —2z. At the double zero 
of &, arg & jumps by 2z. The variation of 4 along the whole of I, 
is therefore either 0 or 47. Hence, the function o,(z) defined in (6) 
is indeed regular and single-valued in D. 

We now introduce two functions K,(z, ¢) and Z,(z, ¢) by the defi- 
nitions 


ie® 
K(z, = Ki(z, = — (or + 
4x 
(7) 
L(z, = = (or — 
4ri 


Both functions have also been written without the subscript A, in 
anticipation of the fact—to be proved further below—that they are 
independent of the parameter X. It is easily confirmed that K(z, ¢) 
is regular in D and that L(z, ¢) has a simple pole at z=f{ with the 
residue (27)-!. 

For z€I, the two functions K(z)=K(z, ¢) and L(z)=L(z, £) are 
connected by the relation 


1 
(8) — L(z)dz = K*(z)ds, 
1 
where ds = | dz| is the length element on [’.? We shall prove (8) by 


showing that both the arguments and the absolute values on both 
sides are equal. We have, by (6), (7) and (3), 


1 1 9 
(9) — K(z)L(2)ds = + or+2/2)ds = 0, 
16x? 
whence 
1 
arg — L(z)dz = — arg K(z) = arg K*(z) = arg K*(z)ds, 
1 


which shows that the arguments are equal. Again by (7), we have 


2 The asterisk denotes conjugate complex quantities. 


1 
1 
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(10) - +i ( - i) 

L(z) Dr4+”/2 /2 
In view of (3) and (6), we have 

2 1/1)t,dz 
(11) (—-) - 

As a result, 0,0; ‘;2/2 is real on I’, whence, by (10), |KL-| =1 for 


z€I. This completes the proof of (8). 

As a first application of (8), we shall show that K(z) and L(z) are 
independent of \ and that we were therefore justified in dropping 
the subscript A in (7). For suppose there are two different pairs of 
functions K;, L; and Ke, L2 belonging to different values of \. Obvi- 
ously, the function Z,—J, is regular in D+I, whence, by Cauchy’s 
theorem, 


[Li(z) — Lo(z)][Ki(z) — Ka(z) = 0. 


In view of (8), this is equivalent to 


f | — Ka(z) = 0, 
r 


whence K,(z) =K.2(z) on I’ and therefore throughout D. The identity 
L,(z) =L.(z) follows by another application of (8). 

The fact that both K(z) and L(z) are independent of \ leads to an 
interesting identity. With the notations 


(12) a(z) = o9(z), = o4/2(z) 
we have, by (7), 

+ = o + ir 

(0, — = — it 
whence 
(13) o(z) = cos Ao(z) + sin Ar(z). 

Consider now the function 
T(z) = o(z)/r(z). 


By (11) and (12), T(z) is real on I. At interior points of D, T(z) can- 
not be real. Indeed, suppose T(z) = —tan Ao, where A» is an arbitrary 
real number between —2/2 and 2/2. This would entail cos Xoo(z) 
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+sin Aor (z) =0. By (13), this would mean that o,(z) has a zero in D, 
which further above was shown not to be true. The same argument 
also shows that every real value is taken by T(z) on I exactly n 
times, this being the number of zeros of o,(z) on I’. Since T(z) is real 
on I but at no interior point of D, it follows therefore that w= T(z) 
maps D onto an times covered half-plane which is bounded by the 
real axis. If, in (6), the positive value of the square root is taken, we 
have, by (6) and (12), T(¢) = —i, which shows that w= T(z) yields a 
(1, 2) mapping of D onto the lower half-plane, Im{w} <0. Conse- 
quently, the function 

T(z) —i o(z) — ir(z) L(z) 
yields a (1, m) mapping of D onto the unit circle. This, together with 
(9) and the argument principle, incidentally shows that K(z) has 
n—1 zeros in D and that L(z) is free of zeros there. 

If we introduce the function g(z) by 


(15) = 4x°L*(z) 
we have, by (14) and (9), 


1 4x? 
(16) — F(z)q(z)dz = —— K(z)L(z)dz = 0, zEr. 
i t 


Comparison of (16) with (1) shows that F(z) and g(z) are identical 
with the functions F(z) and q(z) introduced there, provided we can 
show that the principal part of g(z) at z=¢ is (s—{)-*. This is in- 
deed the case. For suppose the principal part of g(z) at this point is 
(z—f)-?+7(z—f)-'. By the residue theorem and in view of | F(z) | 
=1 we have 


1 
wt 
By virtue of (16), this may also be written 


= F*(z)q(z)dz. 
2riJr 

Since F(z) has a zero at z={—originating from the simple pole of 

L(z)—the integral at the right-hand side vanishes by the Cauchy 

theorem, whence y = 0. This completes the proof of the identity of the 

functions F(z) and g(z) with the functions—denoted by the same 

symbols—introduced at the beginning of this paper. 
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It was shown by Garabedian that the function (27)~' F(z) [q(z) ]¥? 
is identical with the Szegé kernel function [4] of D. In view of (14) 
and (15), this function is identical with K(z). The identity of K(z) 
= K(z, ¢) with the Szegé kernel function is also easily shown with the 
help of (8). Indeed, if f(z) is a function which is regular and single- 
valued in D and continuous in D+TI, we have, by (8) and the 
Cauchy theorem, 


1 
(17) Rye — f f(z)L(2)dz = f K*(z, ¢)f(s)ds, 
4 T 


and this relation is characteristic of the Szegé kernel function. 

The Szegé kernel function K(z, [) can be effectively computed in 
terms of a certain complete orthonormal set of functions [4]. If 
K(z, £) is known, L(z) = L(z, ¢) can also be immediately determined. 
We have, by (8) and the residue theorem, 


K*(t, 1 Lit, ¢)dt 1 


whence 


1 
(18) 2xL(z, ¢) = —— + 


K*(t, 


Once the kernel function is known, we are thus in a position to de- 
termine explicitly the functions F(z) and q(z), as well as the functions 
o,(z) mentioned before. 

If we combine (2), (6), (7), (13), (14), (15), (18), a large number of 
interesting identities connecting K(z), L(z), F(z), q(z), o,(z) with the 
Green’s function and the harmonic measures can be obtained. As an 
example, we prove the identity 


2rK(z, £)L(z, £) 
= p'(z, 5) + yw, (2), 


where, as in (2), Re { p(z, ¢)} is the Green's function of D, Re { w,(z) } 
the harmonic measures of I, (v=1, - - - , m—1) with respect to D, 


and the ¢, are suitable real constants. Indeed, by (9), (6), (2), and 
(12) 


(19) 


2eK(z, o*(z) + 7°(z) 
= Cp’ ’ yt z), 


y=] 


where C, C2, , are constants. Since the residue of the simple 


— 
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wn 


pole of L(z, £) at z= is (2x7)—', the value of constant C is immedi- 
ately seen to be 1. This completes the proof of the identity (19). 

The constants c, in (19) can be expressed in terms of the harmonic 
measures w,(z), their periods P,, with respect to the boundary com- 
ponents I',, and the kernel function. By (8), we have 


f xe )L(z, = ff | K(z, |*ds. 
Ty Ty 


On the other hand, 


f w; (z)dz 


In view of (19), the constants c, are therefore the solutions of the 
system of linear equations 


and 


n—1 
| K(z, |*ds =o (5) + Puc, w=i,---,n—1. 
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NONLINEAR DIFFERENCE EQUATIONS 
CONTAINING A PARAMETER 


F. STEPHENS 


1. Formulation of the problem. Consider the system of nonlinear 
difference equations having the form 


(1) j=l 
f(0,---,0;0; x) =0 (c= 1,---,n), 


where the f; are analytic functions of y;(x) and the constant param- 
eter », continuous functions of the complex variable x, and bounded 
for all values of the y,(x), p, and x for which 


The f; contain terms in p alone and can be developed into convergent 

power series in the y;(x) and p. The 0;;(x) are any continuous and 

bounded functions of x which are defined in the domain G. We could 

have taken the parameter to be a periodic function of period 1. 
In other words, we may write 


(2) fi = gilx, + hilyr(x), , yn(x); 2), 


where 


(2a) ~) = + 


j=? 
and 
= (ao Mm; a0, ++, an an t+--- +a, = m). 
The A,;;(x) (j=1, 2, ---) and are any continuous and 


bounded functions of x such that the right members of (2a) and (2b) 
couverge, where the functions A(x) can conceivably be identically 
zero. 

In a recent work! the author made certain investigations concern- 
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ing systems of nonlinear difference equations similar to system (1). 
However, the functions in the right members of these systems cor- 
responding to the f; are more restrictive as functions of x. A system 
similar to (1) in which the functions corresponding to the 0b;;(x) are 
more restrictive as functions of x was first considered by the author 
in his doctoral dissertation, which was prepared at the University of 
Michigan under the supervision of J. A. Nyswander. Also, the results 
contained in the first two chapters of the author’s paper to which 
reference is made, with slight changes, were obtained in his doctoral 
thesis, Theorem 6 being a generalization of a theorem considered 
there. The procedure for investigating solutions of system (1) which 
will be given in this paper is different from and simpler than the one 
considered in the author’s dissertation. 


2. Preliminary results. The proof of the existence of a solution of 
system (1) which is analytic in p and continuous in x is based on the 


following result. Let us consider the system of nonlinear difference 
equations 


2i(x + 1) = , a(x); x), 
1(0,---,0; p; x) = a(x, p) #0 (¢=1,---,), 


where the /; are analytic functions of the 2;(x), continuous functions 
of x and #, and can be developed into convergent power series starting 
with terms independent of 2;(x). 

Let C be a circle with center at the origin and radius R’(=R-+1). 
Draw tangents ¢ and ?’ to the circle C at the points where the circle 
cuts the axis of imaginaries. Denote by D all points not interior to 
the domain bounded by that part of the circle C which lies to the left 
of the axis of imaginaries and the parts of tangents ¢ and t’ which do 
not lie to the left of this axis. Thus the domain D contains no part of 
the positive axis of reals. 

The following lemma is similar to a result obtained previously by 
the author,” and can be proved by similar methods: 


(3) 


Lema. If we can determine positive constants d, R, and r such that 


1(21(x), , x) | < _r/4n 
for 


then the system of equations (3) has a solution having the form 


2 Ibid. §9. 


| 
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where the right members of equations (4) are absolutely and uniformly 
convergent in x and p for x in the domain D and for |p| =X. 
For the proof, a new system of equations 
(5) + 1) = , an(x); x) (§=1,---,m) 


is introduced, containg a new parameter ¢. A solution 


k=l 
of this system is computed by substitution into (5) and comparison of 
like powers of ¢t. By a method of dominant analytic implicit equations 
the series is shown to converge for =1, and when ¢ is set equal to 1 
a solution of (3) is obtained in the form (4). 


3. Proof. We shall now prove the existence of a solution of system 
(1) which is analytic in the parameter p and continuous in x. In order 
to accomplish this aim, we shall transform the system of equations 
(1) into a system such that the conditions of our lemma are satisfied. 
Let us make the transformation 


(6) yi(x) = (¢=1,---,m). 


Then the system of equations (1) is transformed into the system 


m=2 
= 1;(z:(x),- , 2n(x); p; x) (i= 1,---,n). 


Let r be the smallest of the numbers 7; and let » be the smaller of 
the numbers p and 1. Then we can find positive constants B, E and 
H such that 


for |z:(x)| Sr, |p| Su, |x| 
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We shall take \ to be the smaller of the numbers yp and (1/3(E+A)) 
-(r/4n). Also, take R to be the larger of K and 2B(4n/r). Then 


| , 2n(x); x)| S| p| (E+ + (1/|2|)B 
< (1/3)(r/4n) + (1/2)(r/4n) < r/4n, 


for |2:(x)| <r, |p| SA, |x| SR. 

It follows that the system of equations (7) has a solution of the 
form (4). Also, we know from the properties of the /; and the method 
of computing the C{(x, ») that each term in the right members of 
equations (4) is analytic in » and continuous in x. Therefore, it fol- 
lows that these solution-functions of equations (7) are analytic in p 
and continuous in x, since the right members of equations (4) con- 
verge absolutely and uniformly in x and >. Finally, we obtain from 
these results and equations (6) that the solution-functions thus ob- 
tained for equations (1) are analytic in p, continuous in x, and 
vanish identically for » equal to zero. 


4, Example (x =1). Consider the equation having the form 
(8) + 1) = + f(y(*); x), 0; x) = 0, 


where 


(9) f = g(x, p) + A(y(x); p; x), 
(9a) g(x, p) = 
(9b) h(y(x); p; x) = Do y™(x). 


The power series (9a) and (9b) converge and the function b(x) is 
bounded and continuous for all values of y(x), p, and x for which 


Gi: | y(x)| Sr <1; lp| Sp <1; |x| = K. 
Let us make the transformation 
(10) y(x) = p2(x). 


Then (8) is transformed into the equation 


a(x + 1) = + + PD 
m=2 


jm? 


= I(2(x); p; x). 


| 

| 

= 
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The series Doa<22"(x) converges for | 2(x)| <=r<i and dominates 
the series Doa-2)"-22"(x), since | p| <1. Also, the geometric series 
> }-2*-? converges for | p| <1. Therefore, we can find positive 
constants B, E, and H such that 


| b(x)2(x)| B,| p*?| < E, 
j=2 
< A, 
for 
|plso; 


Take A to be the smaller of the numbers p and (1/3(E+H))(r/4). 
Also, take R to be the larger of the numbers K and 2B(4/r). Then 


| x) | S$ (1/| «|)B+|p| 
S (r/2B(4))B + (1/3(E + H))(r/4)(E + H) < 1/4, 
for 
Sr; [pl sr; 
We have 


p= 


j=2 
b(x — 1) 
p) = (p+ ptt ptt ---) 
(x — 1) 
’ (x — 1)(x — 2) p + ’ 


Therefore, we can write 
a(x) = gi(x)p + go(x)p? + ---, 


where 


gi(x) = 1+ 


b(x — 1) b(x — 1)b(x — 2) 
ae (x — 1)(x — 2) ie 
b(x — 1)--- b(x — n+ 1) 


Finally, we have 


| 
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y(x) = gi(x)p? + go(x)p?+---. 


Direct substitution of this value for y(x) in (8) shows that g:(x) must 
satisfy the nonhomogeneous linear difference equation 


b(x) 


x 


g(x+i1)=1+ g(x). 


We have 
HO 
x x 1) 
(x) b(x — 1) --- — n+ 2) 
x (x—1)---(*#—2+ 2) 
b(x) b(x — 1) 
(1 


gi(x + 1) 1 


1+ 


x 


b(x — 1)--- d(x — n+ 1) ) 
—1)---(#—n+ 1) 
b 
=i+ gi(x). 
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A THEOREM IN FINITE PROJECTIVE GEOMETRY 
AND AN APPLICATION TO STATISTICS 


ESTHER SEIDEN 


R. C. Bose showed that certain statistical problems of design of 
experiments can be attacked fruitfully by interpreting the statistical 
terms involved in terms of finite geometries. In particular, the 
geometrical interpretation proved useful when applied to the prob- 
lem of determining the maximum number of factors which can be 
accommodated in symmetrical factorial design without confounding 
the degrees of freedom belonging to interactions of a given order or 
lesser. 

In a paper Mathematical theory of factorial design (Sankhya vol. 8 
(1947) pp. 107-166) R. C. Bose proved that the maximum number, 
m,(r, s), of factors which can be accommodated in symmetrical fac- 
torial design in which each factor is at s = p” levels (p being a positive 
prime and 2 a positive integer) and each block is of size s’, without 
confounding any degrees of freedom belonging to any interaction 
involving ¢ or lesser number of factors, is given by the maximum 
number of points of a finite projective space PG(r—1, s) (of r homo- 
genous co-ordinates each of which is capable of s values) such that 
no ¢ of the chosen points are conjoint (lie in a space of :—2 dimen- 
sions). Furthermore, R. C. Bose proved in the above mentioned paper 
that m;(4, s)=s?+1 when s is a power of an odd prime and s?+1 
<m;(4, s)Ss?+s+2 when s=2", n>1. This inequality gives, for 
s=4, 17S m,(4, 4) S22. It is the purpose of this paper to prove that 
m;(4, 4)=17. In other words: the maximum number of points of 
PG(3, 4) such that no 3 of them are collinear cannot exceed 17 
(Theorem 5). 

In order to make the reading of this paper independent of the 
literature on finite geometry we explain shortly the PG(3, 4). 
The points in PG(3, 4) can be represented by attaching to it a 
Galois field with 4 elements, GF(4), built with the help of an ir- 
reducible polynomial, e?+-¢+1=0, where a point is understood to be 
an ordered quadruple of elements of GF(4), not all zero. Two points 
(x1, X2, X3, Xs) and (1, ye, ys, ys) are defined to be identical if there 
exists a nonzero element of GF(4), say p, such that x;=py;, i= 1, 2, 3, 4. 
Thus, the total number of points is seen to be (44—1)/3=85. By a line 
passing through two different points (x1, x2, x3, x4) and (¥1, ye, Vs, V4) 
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is understood the set of points (uxi+Ay1, ux2+Aye, UXs+Ays, 
where y, \ are elements of GF(4). It is seen that the number of points 
on a line is 5. Defining, similarly, what is meant by a plane, the 
number of planes passing through a line will be 5 and the number of 
points on a plane 21. Finally, the remark that the observations usual 
in projective geometry concerning the choice of the coordinate system 
hold true in PG(3, 4). 
The following two theorems are known to hold in PG(3, 4). 


THEOREM 1. The three diagonal points of a nondegenerate plane 
quadrangle are distinct and lie on a line, say L. 


The proof follows by taking coordinates such that the points of the 
quadrangle are (0, 1, 0, 0), (0, 0, 1, 0), (0, 0,0, 1), (0, 1, 1, 1). In this 
case the diagonal points are (0, 1, 1, 0), (0, 1, 0, 1), (0, 0, 1, 1) and 
the equation of L becomes evidently x;=0, x2+x3+x,=0. 


THEOREM 2. In every plane there are 6 “no three collinear” points. 


(0, 1, 0, 0), (0, 0, 1, 0), (0, 0, 0, 1), (0, 1, 1, 1), (0, 1, «, €%), (0, 1, e, €) 
are six such points of the plane x;=0. Evidently the last two points 
lie on L and differ from the above-mentioned three diagonal points. 
Moreover, together with them they exhaust the points of L. 


THEOREM 3. Any 4 no three collinear points Q:, Qe, Qs, Qs in a plane 
determine uniquely 2 more points, say Qs, Qe, such that Qi, Qo, ---, Qs 
are no three collinear points. 


Let us consider the five lines in the plane passing through Q;. They 
exhibit all the points of the plane. Three of them coincide with the 
lines determined by the pairs of points QiQ2, Q:03, Q:0s. Each of the 
remaining two lines meets the lines Q2Q3, Q2Q:, Q3Q4 in three distinct 
points. Thus on each of these two lines there remains just one point, 
say Qs, Qc, respectively which form together with Qi, - - - , Qs a set 
of no three collinear points. 


DEFINITION. Any set of 6 no three collinear points of a plane will 
be called 6-basic. 


THEOREM 4. Consider two planes p, and p2 intersecting in a line de- 
termined by two points U;, U2 and 4 more points Q:, Q2, Qs, Qs of pi and 
R, Re, Rs, Rs of p2 so that U;, U2, Q1, Qs, Qs, Os as well as U;, U2, Ri, 
Re, R3, Ry are 6-basic. Then, 

1. The 16 lines Q;R; (¢, 7=1, 2, 3, 4) are distinct. 

2. They can be distributed into four sets of 4 lines so that the 4 lines of 
each set intersect in exactly one point. 
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3. If Q:Q; and RiR,» are coplanar, then they meet the line U,U2 in 
the same point. 

4. The four points of intersection of the lines Q;R;, say O1, O2, Os, Os, 
lie on a plane, say ps, passing through U,U2. 

5. U1, U2, O1, O2, Os, O4 are 6-basic and the role of the sets of Q's 
or R’s or O's can be interchanged. 


ProorF. 1. If a line \ given by Q;R; were identical with that given 
by Q:R:, then, assuming without loss of generality i~k, it would 
follow that points Q;, Q;, R; lie on \ and consequently that R; is in 
against the assumption. 

2. There exists a two-to-two correspondence between the 6 lines 
Q:Q; and the 6 lines R;,R; if we associate with line Q;Q; any of the 
two lines R,R; which pass through the same one of the three points of 
L other than U;, U2. By this correspondence the points of a given tri- 
angle Q;0,Q, are made to correspond to those of an arbitrary triangle 
R.R:iR,, in a unique way. For Q,Q; corresponds to at most one side, 
say R,R:; QiQ, corresponds to another side, say R,R,,; hence the inter- 
section Q; of Q;Q0; and Q,Q, corresponds to the intersection of RR: 
and R;R,, which is R,; and accordingly Q; must correspond to R:, Q, 
to R,,; it is seen that also line Q;Q, corresponds to line R:R,, because 
these lines must pass through the remaining one of the three points 
of L other than U;, U2. Now we apply the converse of Desargue’s 
theorem to the triangles Q;0;0, and R,R:R,, and obtain that the lines 
Q:R:i, Q;Ri, Q.Rm pass through one point, say O; furthermore, if Q’ 
is the fourth point of the Q set, R’ the fourth point of the R set, then 
Q’R’ passes also through O; for the triangle Q;0;Q’ corresponds to 
R,R:iR’ necessarily point by point in the same order since line Q,Q’ is 
different from Q;Q; and Q;Q,, hence these three lines intersect with L 
at three distinct points other than U;, U2, thus determining the inter- 
section of L with Q;Q’ by exclusion, and the same holds for R;R’. 
Applying the converse of Desargue’s theorem to the triangles Q;0;Q’ 
and R;R:R’, we find that Q;R:, Q;Q:, Q’R’ pass through a point which 
necessarily is O. Making any of the four triangles formed by 3 out of 
the four points Ri, R2, R3, Rg correspond to the triangle Q:, Qe, Q3 in 
the indicated manner, we obtain 4 points Oi, Oz, O3, O, as points of per- 
spectivity. They are distinct, for if, for example, the lines QR, 
Q2Re, OsRs, meet at then a line Q;R;, where cannot pass 
through O, because then R;, R;, Q; would be collinear and conse- 
quently Q; would lie on the plane p2 against the assumption. 

3. Two coplanar lines Q;Q; and R:R, meet the line L (intersection 
of the planes p; and #2) in the same point, because otherwise the planes 
p; and 2 would be identical against the assumption. 


} 
) 


1950] A THEOREM IN FINITE PROJECTIVE GEOMETRY 285 


4. Given any two points of the set O;, O2, O03, Ox, say O1 and Oz, 
there exist two points Q;, Q; and two points R;, R,, such that the line 
Q:R: meets Q;R,, in O, and Q;R,, meets Q;R; in O2. Take Q; arbitrary, 
determine R; so that the line Q;R; passes through O,; determine R,, 
so that Q;R,, passes through O2; then O2R; must pass through 
another Q, say Qj. Qi, Q;, Ri, Rx form evidently a nondegenerate plane 
quadrangle and thus the line Q,Q; meets the line R,R,, in a point P 
and, by Theorem 1, O;, O2, P are collinear. But P lies, by 3, on the 
line L, thus O1, O2, U:, U2 are coplanar. The same holds for any 
0;, O;, U1, Us, showing that all O1, Oz, O3, O, lie in a plane through 
Ui, U2, say 

5. Line U;U2 does not contain an O; furthermore any line O0;0; 
meets the line U, U2 at a point P different from U; and U2, so that no 
U lies on a line joining two O’s and no O lies on a line joining two U’s. 
It remains to show that no three of the O’s are collinear. Assume with- 
out loss of generality that 


O, is intersection of with Q2R2, 
Oz is intersection of with Q2Ri, 


Os is intersection of with 


Now if O,, Oz, O; were collinear, then Ri, Re, R3 would be obtained as 
intersections of their plane (which does not contain Q,) with the plane 
Q:0,0,0;; hence Ri, R; would be collinear against assumption. 
Thus no three O’s are collinear. It is seen that, under the assumption 
made above, Q; is the intersection of the lines O2Re, O3R3, 
and is the intersection of 01Q:, 0202, 0303, 010s. Evidently similar 
results follow for the remaining 3 groups of 4 lines O;R; or O;Q; re- 
spectively. Thus the réles of the 0's, Q’s, and R’s can be interchanged. 


THEOREM 5. The maxin:um number of points which can be chosen in 
PG(3, 4) such that no three of the chosen points are collinear does not 
exceed 17. 


Proor. Let C be the set consisting of the maximum number of 
points no three collinear in PG(3, 4). Assume without loss of gen- 
erality that Ui, Uz belong to C. The proof is trivial in case when, 
among the 5 planes passing through U; U2, there do not exist 2 planes 
such that one of them contains 4 points, other than Ui, U2, belonging 
to C and the other plane at least 3 such points, because in this case 
the total number of points belonging toC would readily be seen to be at 
most 17. Hence assume without loss of generality that Q:, Qe, Qs, Os 
as well as Ri, Re, R; belong to C. Consider two cases (1) Ry belongs 


} 
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to C, (2) Ry does not belong to C. 

In case 1 the 16 distinct lines R,Q; (¢, 7=1, 2, 3, 4) contain 16 points 
of each of the two planes, say /4, ps, other than /;, po, p3. Since the 
remaining 5 points of these planes belong to the line L, no points of 
these planes other than U:,U2 belong to C. C can contain at most 4 
additional points of 3; accordingly, the total number of points in C 
amounts to 14 (2+34=14). 

In case 2, the points of p, and ps which belong to C are on the lines 
Q:R, (¢=1, 2, 3, 4). Since the Q’s belong to C by assumption, there 
are at most 4 more points of these lines belonging to C. In addition at 
most 4 points other than Ui,U2 of p; belong to C. Thus in this case 
the total number of points belonging to C is at most 17 (2+3+43 
X4=17). 

REMARK. It can be proved that in case 2 the total number of points 
belonging to C is 13. 

Theorem 5 together with the inequality m;(4, 4)217 proved by R. 
C. Bose gives the required equality, namely m;(4, 4) = 17. 

It is a pleasure to record the author’s indebtedness to Professor R. 
C. Bose for suggesting this problem and to Professor Hans Lewy for 
frequent stimulating discussions which helped to prepare this paper. 

It was proved after the paper was submitted to the editors that 


every set of 17 no three collinear points in PG(3, 4) forms a quadric 
surface. 
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